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I 

MORPHOGENIC PROTEIN SCREENING METHOD 

The invention relates to a method of screening 
drugs for the ability to modulate the level in mammals 
of proteins which can induce tissue morphogenesis and 
to methods of determining which animal tissue(s) and/or 
cell types within a tissue express a particular 
morphogenic protein. 

Background of the Invention 

Cell differentiation is the central characteristic 
of morphogenesis which initiates in the embryo, and 
continues to various degrees throughout the life of an 
organism in adult tissue repair and regeneration 
mechanisms. Members of the TGF-p superfamily include 
subfamilies of highly-related genes that now are 
suspected to play important roles in cell 
differentiation and morphogenesis during development 
and/or during adult life. For example, the Drosophila 
decapentaplegic gene product (DPP) has been implicated 
in formation of the dorsal-ventral axis in fruit flies; 
activins induce mesoderm and anterior structure 
formation in mammals; Miillerian inhibiting substance 
(MIS) may be required for male sex development in 
mammals; growth/differentiation factor- 1 (GDF-1) has 
been implicated in nerve development and maintenance; 
other morphogenic proteins (BMP-2, -3, -4 and 0P-1) 
induce bone formation. 
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The devel pment and study of a bone inducti n m deX 
system has identified the develo p m e ntal cascade of bone 
differentiation as consisting of chemotaxis of 
mesenchymal cells, proliferation of these progenitor 
cells, differentiation of cartilage, ossification and 
hypertrophy of this cartilaginous tissue, vascular 
invasion, bone formation, remodeling, and finally, 
marrow differentiation (Reddi (1981) Collagen Rel. Res. 
Is 209-206). This bone model system, which is studied 
in adult mammals, recapitulates the cascade of bone 
differentiation events that occur in formation of bone 
in the developing fetus. In other studies, the 
epithelium of the urinary bladder has been shown to 
induce new bone formation. Huggins (1931, Arch. Surg. 
22:377-408) showed that new bone formation could be 
induced by surgical transplantation of urinary bladder 
epithelium onto the parietal fascia. Urist (1965, 
Science 150:893-899) demonstrated that implantation Of 
demineralized bone segments resulted in endochondral 
bone formation. The latter study and observation 
suggested the existence of an osteogenic protein and 
that bovine diaphyseal bone Was a source of enriched 
preparations of osteogenic protein (Sampath et. al., J. 
Biol. Chem. 265:13198-13205, 1990; Urist, ibid? Reddi 
et al., Proc* Nat. Aca, Sci. 69:1601-1605, 1972? 
Sampath et al., Proc. Natl. Acad. Sci- 80:6591-6595, 
1983). Proteins capable of inducing endochondral bone 
formation in mammals when implanted in association with 
,a matrix now have been identified in a number of 
different mammalian species, as have the genes encoding 
these proteins, (see, for example, U.S. Patent No. 
4,968,590; U.S. S.N. 315,342 filed February 23, 1989? 
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and U.S. S.N. 599,543, filed Oct ber 18, 1990). Human 
OP-1 DNA has been cloned from vari us cDNA and genomic 
libraries using a consensus probe (Ozkaynak et al., 
EMBO J. 9:2085-2093, 1990). Purified human recombinant 
OP-1, expressed in mammalian cells, has been shown to 
induce new bone formation in vivo . Like other members 
of the TGF-0 superfamily, OP-1 is produced as a 
precursor, glycosylated, processed and secreted as a 
mature dimer. Mature OP-1 is cleaved at a maturation 
site following a sequence with the pattern of RXXR 
(Panganiban et al., Mol. Cell. Biol. 10:2669-2677, 
1990). 

The degree of morphogenesis in adult tissue varies 
among different tissues and depends on, among other 
factors, the degree of cell turnover in a given tissue. 
On this basis, tissues can be divided into three broad 
categories: 1) tissues with static cell populations 
such as nerve and skeletal muscle where there is little 
or no cell division and most of the cells formed during 
development persist throughout adult life and, 
therefore, possess little or no ability for normal 
regeneration after injury; 2) tissues containing 
conditionally renewing populations such as liver where 
there is generally little cell division but, in 
response to an appropriate stimulus or injury, cells 
can divide to produce daughters of the same 
differentiated cell type; and 3) tissues with 
permanently renewing populations including blood, bone, 
testes, and stratified squamous epithelia which are 
characterized by rapid and continuous cell turnover in 
the adult. Here, the terminally differentiated cells 
have a short life span and are replaced through 
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proliferati n f a distinct subpopulati n of cells, 
known as stem or progenitor cells. 

It is an object of this invention to provide a 
method of screening compounds which, when administered 
to a given tissue from a given organism, cause an 
alteration in the level of morphogenic protein 
("morphogen") produced by the tissue. Such compounds, 
when administered systemically, will result in altered 
systemic or local levels of morphogenic activity. This 
morphogenic activity includes the ability to induce 
proliferation and sequential differentiation of 
progenitor cells, and the ability to support and 
maintain the differentiated phenotype or sequence of 
phenotypes through the progression of events that 
results in the formation of normal adult tissue 
(including organ regeneration}* Thus, broadly, the 
invention provides a key to development of additional 
modalities of therapies involving modulation of 
morphogenic protein production in animals or adult 
mammals, e.g.", humans, and consequent correction of 
conditions involving pathologic alteration of the 
balance of tissue cell turnover. Another object of the 
invention is to provide methodologies for identifying 
or selecting a combination of compound(s) which may 
increase a progenitor cell population in a m a mma l/ 
stimulate progenitor cells to differentiate in vivo or 
in vitro , maintain the differentiated phenotype or 
sequence of phenotypes of a tissue, induce tissue- 
specific growth in vivo , or replace diseased or damaged 
tissues or organs in vivo . Another object of the 
invention is to determine the tissue (s) or organ (s) of 
origin of a given morphogen. Another object of the 
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invention is to d terrain the specific cell type(s) 
within the tissue(s) or organ(s) of origin, or cell 
line(s) derived from the tissue(s), or organ(s) of 
origin, that is responsible for the synthesis and 
production of a given morphogen. These and other 
objects and features of the invention will be apparent 
from the description, drawing, and claims which follow. 
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Summary of th Invention 
The invention features a method of screening 
candidate, compounds for the ability to modulate the 
effective local or systemic concentration or level of 
morphogenic protein in an organism. The method is 
practiced by incubating one or more candidate 
compound(s) with cells from a test tissue type of an 
organism known to produce a given morphogen for a time 
sufficient to allow the compaund(s) to affect the 
production, i.e., expression and/or secretion, of 
morphogen by the cells? and then assaying cells and the 
medium conditioned by the cells for a change in a 
parameter indicative of the level of production of the 
morphogenic protein. The procedure may be used to 
identify compounds showing promise as drugs for human 
use capable of increasing or decreasing morphogen 
production in vivo , thereby to correct or alleviate a 
diseased condition. 

In a related ^aspect, the invention features a 
method of screening tissue(s) of an organism to assess 
whether or at what level cells of the tissue(s) produce 
a particular morphogen, thereby to determine a 
tissue(s) of origin of the morphogen. This permits 
selection of the tissue cell type to be used in the 
screening. As used herein, "tissue" refers to a group 
of cells which are* naturally found associated, 
including an organ. 

As an example of tissue(s) or organ(s) which 
produce high levels of morphogen relative to the level 
produced by other types of tissues, it has been 
discovered that OP-1, first found in bone tissue is 
produced at relatively high levels in cells derived 
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from renal, .g., kidney or bladder, r adr nal tissue; 
that GDF-1 is produced at relatively high levels in 
cells derived from nerve, e.g., brain tissue; that DPP 
is produced at relatively high levels in cells derived 
from one of the following drosophila tissues: dorsal 
ectoderm, epithelial imaginal disc, visceral mesoderm, 
or gut endoderm; that vgr-l is produced at relatively 
high levels in cells derived from mouse lung tissue; 
and that Vgl is produced at relatively high levels in 
cells derived from xenopus fetal endoderm tissue. In 
addition, BMP 3 and CBHP2B transcripts have been 
identified in abundance in lung tissue. As used 
herein, "derived" means the cells are the cultured 
tissue itself, or are a cell line whose parent cells 
are the tissue itself. 

Preferred methods for determining the level of or a 
change in the level of a morphogen in a cultured cell 
include using an antibody specific for the morphogen, 
e.g., in an immunoassay such as an ELISA or 
radioimmunoassay; and determining the level of nucleic 
acid, most particularly mRNA, encoding the morphogen 
using a nucleic acid probe that hybridizes under 
stringent conditions with the morphogen RNA, such as in 
an RNA dot blot analysis. Where a change in the 
presence and/or concentration of morphogen is being 
determined, it will be necessary to measure and compare 
the levels of morphogen in the presence and absence of 
the candidate compound. The nucleic acid probe may 
be a nucleotide sequence encoding the morphogen or a 
fragment large enough to hybridize specifically only to 
RNA encoding a specific morphogen under stringent 
conditions. As used herein, "stringent conditions n are 
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d fin d as conditions in which non-specific hybrids 
will be eluted but at which specific hybrids will be 
maintained, i.e., incubation at 0.1X SSC (15mM NaCl, 
5inM Na citrate} at 50°C for 15 minutes. 

Examples of morphogens whose levels may be 
determined according to the invention include OP-1, OP- 
2, GDF-1, Vgr-1, DPP, 60A CBMP2A, CBMP2B, BMP 2, 3, 4, 
5, 6 V or Vgl. Thus, if an immunoassay is used to 
indicate the presence and/or concentration of a 
morphogen, an antibody specific for one of these 
morphogens only, and which will not detect the presence 
of other morphogens, will be used. Similarly, if 
nucleic acid hybridization is used to indicate the 
level of RNA encoding the morphogen, a nucleotide probe 
specific for one of these morphogens only will be used 
under hybridization conditions such that the probe 
should not be capable of hybridizing with RNA encoding 
a different morphogen. A morphogen includes an active 
C-terminal core region, which includes at least six 
cysteine residues, and a region N-terminal to the C- 
terminal region that is relatively non-homologous to 
the equivalent N-terminal regions of other morphogens . 
In addition, the 3' noncoding region of the mRNA is 
unique to each morphogen. Thus, a nucleic acid probe 
encoding all or a portion of the sequences N-terminal 
to the C-terminal core region of a morphogen, or 
encoding all or a portion of the sequences C-terminal 
to or 3' to the core region of a morphogen may be used 
jas a probe which detects mRNA encoding that morphogen 
only. 

-Morphogenic proteins" or "morphogens", as used 
herein, include naturally-occurring osteogenic proteins 
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capable of inducing the full dev 1 pmental cascad of 
bone formation, as well as polypeptide chains not 
normally associated with bone or bone formation, but 
sharing substantial sequence homology with osteogenic 
proteins. Such proteins, as well as DNA sequences 
encoding them, have been isolated and characterized for 
a number of different species. See. for example, U.S. 
Patent No. 4,968,590 and U.S. Patent Number. 5,011,691, 
U.S. application Serial Number 1989; 422,699, filed 
October 17, 1989, and 600,024 and 599,543, both filed 
October 18, 1990; Sampath et al., (1990) J. Biol. Chem. 
265:13198-13205; Ozkaynak et al. (1990) EMBO J. 9x2085- 
2093; and Lee, Proc. Nat. Aca. Sci. 88:42504254 (1991), 
all of which are hereby incorporated by reference. 
Many of these proteins subsequently were discovered to 
have utility beyond bone morphogenesis. See, e.g., 
USSN 667,274 filed March 11, 1991. The mature forms of 
morphogens share substantial amino acid sequence 
homology, especially in the C- terminal core regions of 
the proteins. In particular, most of the proteins 
share a seven-cysteine skeleton in this region, in 
addition to other apparently required amino acids. 
Table II, infra, shows the amino acid sequence 
homologies for nine morphogens over the carboxy 
terminal 102 amino acids. 

Among the morphogens useful in this invention are 
proteins originally identified as osteogenic proteins, 
such as the OP-1, OP-2 and CBMP2 proteins, as well as 
amino acid sequence-related proteins such as DPP (from 
Drosophila), Vgl ( f rom Xenopus ) , Vgr-1 (from mouse, see 
U.S. 5,011,691 to Oppexmann et al.), GDF-1 (from mouse, 
see Lee (1991) PNAS 88:4250-4254), all of which are 
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presented in Tabl II and Seq. ID Nos.5-14}, and the 
recently identified 60A protein (from Drosophila, Seq. 
ID No* 24, see Wharton et al. (1991) PNAS 
jl8: 9214-9218.) The members of this family, which 
include members of the TGF-p super-family of proteins, 
share substantial amino acid sequence homology in their 
C-terminal regions. The proteins are translated as a 
precursor, having an N-terminal signal peptide 
sequence, typically less than about 30 residues, 
followed by a "pro* domain that is cleaved to yield the 
mature sequence. The signal peptide is cleaved rapidly 
upon translation, at a cleavage site that can be 
predicted in a given sequence using the method of Von 
Heijne ((1986) Nucleic Acids Research 14:4683-4691.) 
Table I, below, describes the various morphogens 
identified to date, including their nomenclature as 
used herein, their Seq. ID references, and publication 
sources for the amino acid sequences for the full 
length proteins not included in the Seq. Listing. The 
disclosure of these publications is incorporated herein 
by reference. 

TABLE I 

"OP-1" refers generically to the group of 

morphogenically active proteins expressed 
from part or all of a DNA sequence 
encoding OP-1 protein, including allelic 
and species variants thereof, e.g., human 
OP-1 ("hOP-1", Seq. ID No. 5, mature 
protein amino acid sequence), or mouse 
OP-1 ("mOP-1", Seq. ID No. 6, mature 
protein amino acid sequence. ) The 
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cons rv d sev n cysteine sk leton is 
defined by residues 38 to 139 of Seq. ID 
Nos. 5 and 6* The cDNA sequences and the 
amino acids encoding the full length 
proteins are provided in Seq. Id Nos* 16 
and 17 (hOPl) and Seq* ID Nos* 18 and 19 
(mOPl.) The mature proteins are* defined 
by residues 293-431 (hOPl) and 292-430 
(mOPl). The "pro" regions of the 
proteins, cleaved to yield the mature, 
morphogenically active proteins are 
defined essentially by residues 30-292 
(hOPl) and residues 30-291 (mOPl). 

"OP-2" refers generically to the group of active 

proteins expressed from part or all of a 
DNA sequence encoding OP-2 protein, 
including allelic and species variants 
thereof, e.g., human OP-2 ("hOP-2", Seq* 
ID No. 7, mature protein amino acid 
sequence) or mouse OP-2 ("mOP-2", Seq* ID 
No. 8, mature protein amino acid 
sequence)* The conserved seven cysteine 
skeleton is defined by residues 38 to 139 
of Seq. ID Nos. 7 and 8. The cDNA 
sequences and the amino acids encoding the 
full length proteins are provided in Seq. 
ID Nos. 20 and 21 (hOP2) and Seq. ID Nos* 
22 and 23 (mOP2*) The mature proteins are 
defined essentially by residues 264-402 
(hOP2) and 261-399 (mOP2). The "pro" 
regions of the proteins, cleaved to yield 
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the mature^ morphogenically active 
proteins likely are defined essentially by 
residues 18-263 (hOP2) and residues 18-260 
(mOP2). (Another cleavage site also 
occurs 21 residues upstream for both OP-2 
proteins.) 

n CBMP2" refers generically to the morphogenically 

active proteins expressed from a part or 
all of a DMA sequence encoding the CBHP2 
proteins, including allelic and species 
variants thereof/ e.g., human CBHF2A 
("CBMP2A(faG) "/ Seq ID No. 9) or human 
CBMP2B DNA ( J, CBMP2B ( f at ) " f Seq. ID No. 10). 
The amino acid sequence for the full 
length proteins,, referred to in the 
literature as BMP2A and BMP2B, or BMP2 and 
BHP4, appear in Wozney, et al. (1988) 
Science 242:1528-1534. The pro domain for 
BMP2 (BHP2A) likely includes residues 25- 
248 or 25-282; the mature protein, 
residues 249-396 or 283-396* The pro 
domain for BMP4 (BMP2B) likely includes 
residues 25-256 or 25-292; the mature 
protein, residues 257-408 or 293-408. 

"DPP (fx) w . refers to protein sequences encoded by the 
Drosophila DPP gene and defining the 
conserved seven cysteine skeleton (Seq. ID 
No. 11). The amino acid sequence for the 
full length protein appears in Padgett, et 
al (1987) Nature 325 t 81-84 * The pro 
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domain likely extends from th signal 

peptide cleavage site to residue 456; the 
mature protein likely is defined by 
residues 457-588. 

"Vgl(fx)" refers to protein sequences encoded by the 
Xenopus Vgl gene and defining the 
conserved seven cysteine skeleton (Seq. ID 
No* 12). The amino acid sequence for the 
full length protein appears in 
Weeks (1987) Cell 51; 861-867. The pro 
domain likely extends from the signal 
peptide cleavage site to residue 246; the 
mature protein likely is defined by 
residues 247-360. 

•'Vgr-l(fx)" refers to protein sequences encoded by the 
murine Vgr-1 gene and defining the 
conserved seven cysteine skeleton (Seq. ID 
No. 13). The amino acid sequence for the 
full length protein appears in Lyons , et 
al, (1989) PNAS 86: 4554-4558. The pro 
domain likely extends from the signal 
peptide cleavage site to residue 299; the 
mature protein likely is defined by 
residues 300-438. 

"GDF-l(fx) " refers to protein sequences encoded by the 
human GDF-1 gene and defining the 
conserved seven cysteine skeleton (Seq. ID 
No. 14). The cDNA and encoded amino 
sequence for the full length protein is 
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provided in Seq. ID. No- 32. The pro 
domain likely extends from the signal 
peptide cleavage site to residue 214; the 
mature protein likely is defined by 
residues 215-372* 

"60A ,r refers genericallyto the morphogenically 

active proteins expressed from part or all 
of a DNA sequence (from the Drosophila 6 OA 
gene) encoding the 60A proteins (see Seq. 
ID No- 24 wherein the cDNA and encoded 
amino acid sequence for the full length 
protein is provided}. n 60A(fx) M refers to 
the protein sequences defining the 
conserved seven cysteine skeleton 
(residues 354 to 455 of Seq* ID No* 24.) 
The pro domain likely extends from the 
signal peptide cleavage site to residue 
324; the mature protein likely is defined 
by residues 325-455. 

"BMPSffx)" refers to protein sequences encoded by the 
human BMP3 gene and defining the conserved 
seven cysteine skeleton (Seq. ID No* 26). 
The amino acid sequence for the full 
length protein appears in wozney et al. 
(1988) Science 242 : 1528-1534. The pro 
domain likely extends from the signal 
peptide cleavage site to residue 290; the 
mature protein likely is defined by 
residues 291-472. 
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"BMP5(fx) w refers t pr t in seguenc s encoded by th 
human BMP5 gene and defining the conserved 
seven cysteine skeleton (Seq. ID No. 27). 
The amino acid sequence for the full 
length protein appears in Celeste, et al. 
(1991) PNAS 87: 9843-9847* The pro domain 
likely extends from the signal peptide 
cleavage site to residue 316; the mature 
protein likely is defined by residues 
317-454. 

"BMP6(fx)" refers to protein sequences encoded by the 
human BMP 6 gene and defining the conserved 
seven cysteine skeleton (Seq. ID No. 28). 
The amino acid sequence for the full 
length protein appear sin Celeste, et al. 
(1990) PNAS 87 x 9843-5847. The pro domain 
likely includes extends from the signal 
peptide cleavage site to residue 374; the 
mature sequence likely includes 
residues 375-513. 



The OP-2 proteins have an additional cysteine 
residue in this region (e.g., see residue 41 of Seq. ID 
Nos. 7 and 8), in addition to the conserved cysteine 
skeleton in common with the other proteins in this 
family. The GDF-1 protein has a four amino acid insert 
within the conserved skeleton (residues 44-47 of Seq. 
ID No. 14) but this insert likely does not interfere 
with the relationship of the cysteines in the folded 
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structure. In addition, the CBMP2 prot ins are missing 
one amino acid residue within the cysteine skeleton. 

The morphogens are inactive when reduced, but are 
active as oxidized hamodimers and when oxidized in 
combination with other morphogens of this invention. 
Thus, as defined herein, a morphogen is a dimeric 
protein comprising a pair of polypeptide chains, 
wherein each polypeptide chain comprises at least the 
C-terminal six cysteine skeleton defined by residues 
43-139 of Seq. ID No. 5, including functionally 
equivalent arrangements of these cysteines (e.g., amino 
acid insertions or deletions which alter the linear 
arrangement of the cysteines in the sequence but not 
their relationship in the folded structure), such that, 
when the polypeptide chains are folded, the dimeric 
protein species comprising the pair of polypeptide 
chains has the appropriate three-dimensional structure, 
including the appropriate intra- and inter-chain 
disulfide bonds such that the protein is capable of 
acting as a morphogen as defined herein. Specifically, 
the morphogens generally are capable of the following 
biological functions in a morphogenically permissive 
environment: stimulating proliferation of progenitor 
cells; stimulating the differentiation of progenitor 
cells; stimulating the proliferation of differentiated 
cells; and supporting the growth and maintenance of 
differentiated cells, including the "redifferentiation w 
pf transformed cells. In addition, it is also 
anticipated that these morphogens are capable of 
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inducing redif f erentiati n f committed cells under 
appropriate environmental conditions* 

Morphogens useful in this invention comprise one of 
two species of generic amino acid sequences: Generic 
Sequence 1 (Seq. ID No* 1) or Generic Sequence 2 (Seq. 
ID No* 2); where each Xaa indicates one of the 20 
naturally-occurring L- isomer, ct-amino acids or a 
derivative thereof. Generic Sequence 1 comprises the 
conserved six cysteine skeleton and Generic Sequence 2 
comprises the conserved six cysteine skeleton plus the 
additional cysteine identified in OP-2 (see residue 36, 
Seq. ID No* 2) . In another preferred aspect, these 
sequences further comprise the following additional 
sequence at their N- terminus: 

Cys Xaa Xaa Xaa Xaa (Seq* ID No. 15) 
1 5 

Preferred amino acid sequences within the foregoing 
generic sequences include: Generic Sequence 3 (Seq. ID 
No. 3), Generic Sequence 4 (Seq* ID No* 4), Generic 
Sequence 5 (Seq. ID No* 30) and Generic Sequence 6 
(Seq. ID No. 31), listed below. These Generic 
Sequences accommodate the homologies shared among the 
various preferred members of this morphogen family 
identified in Table II, as well as the amino acid 
sequence variation among them. Specifically, Generic 
Sequences 3 and 4 are composite amino acid sequences of 
the following proteins presented in Table II and 
identified in Seq. ID Nos. 5-14: human OP-1 (hOP-1, 
Seq. ID Nos. 5 and 16-17), mouse OP-1 (mOP-1, Seq. IP 
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Nos. 6 and 18-19), human and mouse OP-2 (Seq. ID 
Nos. 7, 8, and 20-22), CBMP2A (Seq. ID No. 9), CBMP2B 
(Seq. ID No. 10) , DPP (from Drosophila, Seq. ID 
No. 11) , Vgl, (from Xenopus, Seq. ID No. 12) , Vgr-1 
(from mouse, Seq, ID No. 13), and GDP-1 (from mouse, 
Seq. ID No. 14.) The generic sequences include both 
the amino acid identity shared by the sequences in 
Table II, as well as alternative residues for the 
variable positions within the sequence. Note that 
these generic sequences allow for an additional 
cysteine at position 41 or 46 in Generic Sequences 3 or 
4, respectively, providing an appropriate cysteine 
skeleton where inter- or intramolecular disulfide bonds 
can form, and contain certain critical amino acids 
which influence the tertiary structure of the proteins. 

Generic Sequence 3 
Leu Tyr Val Xaa Phe 

1 5 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

. 10 

Xaa Ala Pro Xaa Gly Xaa Xaa Ala 

15 20 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 

25 30 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 
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Xaa Xaa Xaa Asn His Ala Xaa Xaa 

40 45 
Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

70 75 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

85 90 
Xaa Cys Gly Cys Xaa 
95 

wherein each Xaa is independently selected from a group 
of one or more specified amino acids defined as 
follows: "Res." means "residue" and Xaa at res .4 * 
(Ser, Asp or Glu); Xaa at res. 6 = (Arg, Gin, Ser or 
Lys); Xaa at res. 7 '= (Asp or Glu); Xaa at res. 8 = (Leu 
or Val); Xaa at res. 11 ■ (Gin, Leu, Asp, His or Asn); 
Xaa at res* 12 = (Asp, Arg or Asn); Xaa at res. 14 = (He 
or Val); Xaa at res. 15 = (lie or Val); Xaa at res. 18 * 
(Glu, Gin, Leu, Lys, Pro or Arg); Xaa at res. 20 * (Tyr 
or Phe); Xaa: at res. 21 - (Ala, Ser, Asp, Met, His, Leu 
or Gin); Xaa at res. 23 = (Tyr, Asn or Phe); Xaa at 
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res. 26 - (Glu, His, Tyr, Asp r Gin); Xaa at res.28 - 
(Glu, Lys, Asp or Gin) ? Xaa at res. 30 « (Ala, Ser, Pro 
or Gin)? Xaa at res. 31 = (Phe, Leu or Tyr); Xaa at 
res. 33- (Leu or Val); Xaa at res. 34 - (Asn, Asp, Ala 
or Thr)? Xaa at res -35 « (Ser, Asp, Glu, Leu or Ala); 
Xaa at res. 36 = (Tyr, Cys, His, Ser or lie); Xaa at 
res. 37 « (Met, Phe, Gly or Leu); Xaa at res. 38 - (Asn 
or Ser); Xaa at res. 39 = (Ala, Ser or Gly); Xaa at 
res. 40 = (Thr, Leu or Ser); Xaa at res.44 = (lie or 
Val)? Xaa at res. 45 - (Val or Leu); Xaa at res. 46 = 
(Gin or Arg); Xaa at res. 4? - (Thr, Ala or Ser); Xaa at 
res. 4 9 - (Val or Met); Xaa at res. 50 « (His or Asn); 
Xaa at res. 51 - (Phe, Leu, Asn, Ser, Ala or Val); Xaa 
at res. 52 « (He, Met, Asn, Ala or Val); Xaa at res. 53 
« (Asn, Lys, Ala or Glu); Xaa at res. 54 » (Pro or Ser); 
Xaa at res .55 = (Glu, Asp, Asn, or Gly); Xaa at res. 56 
* (Thr, Ala, Val, Lys, Asp, Tyr, Ser or Ala); Xaa at 
res. 57 - (Val, Ala or lie); Xaa at res. 58 - (Pro or 
Asp); Xaa at res. 5 9 « (Lys or Leu); Xaa at res. 60 = 
(Pro or Ala); Xaa at res. 63 = (Ala or Val)? Xaa at 
res. 65 ■ (Thr or Ala); Xaa at res. 66 = (Gin, Lys, Arg 
or Glu); Xaa at res. 67 » (Leu, Met or Val)? Xaa at 
res. 68 = (Asn, Ser or Asp)? Xaa at res. 69 = (Ala, Pro 
or Ser); Xaa at res. 70 - (lie, Thr or Val); Xaa at 
res. 71 « (Ser or Ala); Xaa at res. 72 » (Val or Met); 
Xaa at res. 74 - (Tyr or Phe); Xaa at res. 75 = (Phey Tyr 
or Leu); Xaa at res. 76 * (Asp or Asn); Xaa at res. 77 ■ 
(Asp, Glu, Asn or Ser)? Xaa at res. 78 «* (Ser, Gin, Asn 
or Tyr); Xaa at res. 79 = (Ser, Asn, Asp or Glu); Xaa at 
res. 80 » (Asn, Thr or Lysl? Xaa at res. 82 - (He or 
Val); Xaa at res. 84 * (Lys or Arg); Xaa at res. 85 ■ 
(Lys, Asn, Gin or His); Xaa at res.86 = (Tyr or His); 
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Xaa at r s.87 « (Arg, Gin or Glu)j Xaa at res. 88 = 
(Asn, Glu or Asp); Xaa at res .90 = (Val, Thr or Ala); 
Xaa at res. 92 » (Arg, Lys, Val, Asp or Glu); Xaa at 
res. 93 « (Ala, Gly or Glu); and Xaa at res. 97 » (His or 
Arg); 

Generic Sequence 4 

Cys Xaa Xaa Xaa Xaa Leu Tyr Val Xaa Phe 
1 5 10 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

15 

Xaa Ala Pro Xaa Gly Xaa Xaa Ala 

20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
30 35 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

40 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
45 50 
Xaa Xaa Leu Xaa Xaa Xaa xaa Xaa 

55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 
90 95 
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xaa Cys Gly Cys Xaa 
100 

wherein each Xaa is independently selected from a group 
of one or more specified amino acids as defined by the 
following: "Res." means "residue" and Xaa at res. 2 » 
(Lys or Arg); Xaa at res. 3 « (Lys or Arg) ; Xaa at res. 4 
« (His or Arg); Xaa at res. 5 - (Glu, Ser, His., Gly, Arg 
or Pro); Xaa at res. 9 ». (Ser, Asp or Glu); Xaa at 
res. 11 ■ (Arg, Gin, Ser or Lys); Xaa at res. 12 « (Asp 
or Glu); Xaa at res. 13 - (Leu or Val); Xaa at res. 16 - 
(Gin, Leu, Asp, His or Asn); Xaa at res. 17 ■ (Asp, Arg, 
or Asn); Xaa at res. 19 - (lie or Val); Xaa at res. 20 - 
(lie or Val); Xaa at res. 23 « {Glu, Gin, Leu, Lys, Pro 
or Arg); Xaa at res. 25 - (Tyr or Phe); Xaa at res. 26 - 
(Ala, Ser, Asp, Met, His, Leu, or Gin); Xaa at res. 28 « 
(Tyr, Asn or Phe); Xaa at res. 31 = (Glu, His, Tyr, Asp 
or Gin); Xaa at res. 33 - Glu, Lys, Asp or Gin); Xaa at 
res. 35 = (Ala, Ser or Pro); Xaa at res. 36 - (Phe, Leu 
or Tyr); Xaa at res. 38 ■ (Leu or Val); Xaa at res. 39 - 
(Asn, Asp, Ala or Thr); Xaa at res. 40 » (Ser, Asp, Glu, 
Leu or Ala); Xaa at res. 41 - (Tyr, Cys, His, Ser or 
lie); Xaa at res. 42 - (Met, Phe, Gly or Leu); Xaa at 
res. 44 - (Ala, Ser or Gly); Xaa at res. 45 » (Thr, Leu 
or Ser); Xaa at res. 49 - (lie or Val); Xaa at res. 50 - 
(Val or Leu); Xaa at res .51 - (Gin or Arg); Xaa at 
res. 52 - (Thr, Ala'or Ser); Xaa at res. 54 - (Val or 
Met); Xaa at res. 55 » (His or Asn); Xaa at res. 56 - 
(Phe, Leu, Asn, Ser, Ala or Val); Xaa at res. 57 « (lie, 
Met, Asn, Ala or Val); xaa at res. 5 8 - (Asn, Lys, Ala 
or Glu); Xaa at res. 59 - (Pro or Ser); Xaa at res. 60 - 
(Glu, Asp, or Gly); Xaa at res. 61 •* (Thr, Ala, Val, 
Lys, Asp, Tyr, Ser or Ala); Xaa at res. 62 - (Val, Ala 
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or lie); Xaa at res .63 « (Pr or Asp); Xaa at r 8.64 « 
(Lys or Leu); Xaa at res. 65 * (Pro or Ala); Xaa at 
res. 68 = (Ala or Val); Xaa at res. 70 - (Thr or Ala); 
Xaa at res. 71 - (Gin, Lys, Arg or Glu); Xaa at res. 72 * 
(Leu, Met or Val); Xaa at res .73 » (Asn, Ser or Asp); 
Xaa at res .74 - (Ala, Pro or Ser); Xaa at res. 75 - 
(lie, Thr or Val); Xaa at res. 76 » (Ser or Ala); Xaa at 
res. 77 - (Val or Met); Xaa at res. 7 9 « (Tyr or Phe); 
Xaa at res .80 « (Phe, Tyr or Leu); Xaa at res .81 - (Asp 
or Asn); Xaa at res. 82 « (Asp, Glu, Asn or Ser); Xaa at 
res. 83 = (Ser, Gin, Asn or Tyr); Xaa at res. 84 * (Ser, 
Asn, Asp or Glu); Xaa at res. 85 » (Asn, Thr or Lys); 
Xaa at res. 87 * (lie or Val); Xaa at res. 8 9 = (Lys or 
Arg); Xaa at res. 90 * (Lys, Asn, Gin or His); Xaa at 
res .91 * (Tyr or His); Xaa at res .92 - (Arg, Gin or 
Glu); Xaa at res .93 = (Asn, Glu or Asp); Xaa at res .95 
» (Val, Thr or Ala); Xaa at res. 97 « (Arg, Lys, Val, 
Asp or Glu); Xaa at res .98 = (Ala, Gly or Glu); and Xaa 
at res. 102 - (His or Arg). 

Similarly, Generic Sequence 5 (Seq. ID No. 30) and 
Generic Sequence 6 (Seq. ID No. 31) accommodate the 
homologies shared among all the morphogen protein 
family members identified in Table II. Specifically, 
Generic Sequences 5 and 6 are composite amino acid 
sequences of human OP-1 (hOP-1, Seq* 10 Nos. 5 and 16- 
17), mouse OP-1 (mOP-1, Seq. ID Nos. 6 and 18-19), 
human and mouse OP-2 (Seq. ID Nos. 7, 8, and 20-22), 
-CBMP2A (Seq. ID No. 9), CBMP2B (Seq. ID No. 10), DPP 
(from Drosophila, Seq* ID No. 11), Vgl, (from Xenopus, 
Seq. ID No. 12), Vgr-1 (from mouse, Seq. ID No. 13), 
and GDF-1 (from mouse, Seq. ID No. 14), human BMP 3 
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(Seq. ID No. 26), human BMP5 (Seq. ID No. 27), human 
BMP6 (Seq. ID No. 28} and 60(A) (from Drosophila, Seq. 
ID Nos. 24-25). The generic sequences include both the 
amino acid identity shared by these sequences in the 
C-terminal domain, defined by the six and seven 
cysteine skeletons (Generic Sequences 5 and 6, 
respectively), as well as alternative residues for the 
variable positions within the sequence. As for Generic 
Sequences 3 and 4, Generic Sequences 5 and 6 allow for 
an additional cysteine at position 41 (Generic Sequence 
5) or position 46 (Generic Sequence 6), providing an 
appropriate cysteine skeleton where inter- or 
intramolecular disulfide bonds can form, and containing 
certain critical amino acids which influence the 
tertiary structure of the proteins. 

Generic Sequence 5 

Leu Xaa Xaa Xaa Phe 
1 5 
Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

10 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 

15 20 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 

25 30 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 
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Xaa Xaa Xaa Asn His Ala Xaa Xaa 

40 45 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

70 75 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

85 90 
Xaa Cys Xaa Cys Xaa 
95 

wherein each Xaa is independently selected from a group 
of one or more specified amino acids defined as 
follows: "Res." means "residue" and Xaa at res .2 * 
(Tyr or Lys); Xaa at res .3 = Val or lie); Xaa at res. 4 
= (Ser, Asp or Glu); Xaa at res. 6 = (Arg, Gin, Ser, Lys 
or Ala); Xaa at res. 7 = (Asp, Glu or Lys); Xaa at res .8 
= (Leu, Val or lie); Xaa at res. 11 « (Gin, Leu, Asp, 
His, Asn or Ser); Xaa at res. 12 - (Asp, Arg, Asn or 
Glu); Xaa at res .14 - (He or Val); Xaa at res. 15 - 
(lie or Val); Xaa at res. 16 (Ala or Ser); Xaa at res. 18 
- (Glu, Gin, Leu, Lys, Pro or Arg); Xaa at res. 19 ■ 
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(Gly or Ser); Xaa at r s.20 « (Tyr or Phe); Xaa at 
res. 21 = (Ala, Ser, Asp, Met, His, Gin, Leu or Gly); 
Xaa at res. 2 3 - (Tyr, Asn or Phe); Xaa at res. 2 6 » 
(Glu, His, Tyr, Asp, Gin or Ser); Xaa at res. 28 » (Glu, 
Lys, Asp, Gin or Ala); Xaa at res. 30 « (Ala, Ser, Pro, 
Gin or Asn); Xaa at res. 31 - (Phe, Leu or Tyr); Xaa at 
res. 33 = (Leu, Val or Met); Xaa at res. 34 * (Asn, Asp, 
Ala, Thr or Pro); Xaa at res.35 » (Ser, Asp, Glu, Leu, 
Ala or Lys); Xaa at res. 3 6 » (Tyr, Cys, His, Ser or 
lie); Xaa at res. 37 - (Met, Phe, Gly or Leu); Xaa at 
res. 38 » (Asn, Ser or Lys); Xaa at res. 39 » (Ala, Ser, 
Gly or Pro); Xaa at res. 40 » (Thr, Leu or Ser); Xaa at 
res. 44 - (lie, Val or Thr); Xaa at res. 45 - (Val, Leu 
or lie); Xaa at res. 46 - (Gin or Arg); Xaa at res. 47 « 
(Thr, Ala or Ser); Xaa at res.48 - (Leu or lie); Xaa at 
res. 49 - (Val or Met)? Xaa at res. SO - (His, Asn or 
Arg); Xaa at res. 51 * (Phe, Leu, Asn, Ser, Ala or Val); 
Xaa at res. 52 « (lie, Met, Asn, Ala, Val or Leu); Xaa 
at res. 53 - (Asn, Lys, Ala, Glu, Gly or Phe); Xaa at 
res. 54 - (Pro, Ser or Val); Xaa at res. 55 - (Glu, Asp, 
Asn, Gly, val or Lys); Xaa at res. 56 « (Thr, Ala, Val, 
Lys, Asp, Tyr, Ser, Ala, Pro or His); Xaa at res. 57 - 
(Val, Ala or lie); Xaa at res. 58 = (Pro or Asp); Xaa at 
res. 59 « (Lys, Leu or Glu); Xaa at res. 60 « (Pro or 
Ala); Xaa at res. 63 » (Ala or Val); Xaa at res. 65 * 
(Thr, Ala or Glu); Xaa at res. 66 « (Gin, Lys, Arg or 
Glu); Xaa at res. 67 - (Leu, Met or Val); Xaa at res. 68 
= (Asn, Ser, Asp or Gly); Xaa at res. 69 - (Ala, Pro or 
Ser); Xaa at res. 70 « (lie, Thr, Val or Leu); Xaa at 
res. 71 « (Ser, Ala or Pro); Xaa at res. 72 • (Val, Met 
or lie); Xaa at res. 74 - (Tyr or Phe); Xaa at res. 75 - 
(Phe, Tyr, Leu or His); Xaa at res. 76 - (Asp, Asn or 
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Leu); Xaa at r s.77 - (Asp, Glu, Asn or Ser); Xaa at 
res .78 « (Ser, Gin, Asn, Tyr or Asp); Xaa at res . 79 m 
(Ser, Asn, Asp, Glu or Lys); Xaa at res. 80 = (Asn, Thr 
or Lys); Xaa at res. 82 = (lie, Val or Asn); Xaa at 
res .84 = (Lys or Arg); Xaa at res ,85 * (Lys, Asn, Gin, 
His or Val); Xaa at res. 86 « (Tyr or His); Xaa at 
res .87 » (Arg, Gin, Glu or Pro); Xaa at res .88 « (Asn, 
Glu or Asp); Xaa at res. 90 « (Val, Thr, Ala or lie); 
Xaa at res. 92 ■ (Arg, Lys, Val, Asp or Glu); Xaa at 
res. 93 « (Ala, Gly, Glu or Ser); Xaa at res. 95 - (Gly 
or Ala) and Xaa at res. 97 » (His or Arg). 

Generic Sequence 6 



Cys Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Phe 
15 10 
Xaa Xaa Xaa Gly Tip Xaa Xaa Trp Xaa 

15 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 

20 25 
Xaa Tyr, Cys Xaa Gly Xaa Cys Xaa 
30 35 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

40 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
45 50 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 
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xaa xaa Xaa Leu Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

90 95 
Xaa Cys Xaa Cys Xaa 
100 

wherein each Xaa is independently selected from a group 
of one or more specified amino acids as defined by the 
following: "Res." means "residue" and Xaa at res. 2 » 
(Lys, Arg, Ala or Gin); Xaa at res r 3 - (Lys, Arg or 
Met); Xaa at res. 4 - (His, Arg or Gin); Xaa at res. 5 - 
(Glu, Ser, His, Gly, Arg, Pro, Thr, or Tyr) ; Xaa at 
res. 7 - (Tyr or Lys); Xaa at res. 8 =» (Val or He); Xaa 
at res. 9 = (Ser, Asp or Glu); Xaa at res. 11 - (Arg, 
Gin, Ser, Lys or Alaf; Xaa at res. 12.= (Asp, Glu, or 
Lys); Xaa at res. 13 « (Leu, Val or He); Xaa at res. 16 
= (Gin, Leu, Asp, His, Asn or Ser); Xaa at res. 17 ■= 
(Asp, Arg, Asn or Glu); Xaa at res. 19 ■* (He or Val); 
Xaa at res. 20 = (lie or Val); Xaa at res. 21 = (Ala or 
Ser); Xaa at res. 23 = (Glu, Gin, Leii, Lys, Pro or Arg); 
Xaa at res. 24 « (Gly or Ser); Xaa at res. 25 - (Tyr or 
Phe); Xaa at res. 26 « (Ala, Ser, Asp, Met, His, Gin, 
Leu, or Gly); Xaa at res. 28 - (Tyr, Asn or Phe); Xaa at 
res. 31 - (Glu, His, Tyr, Asp, Gin or Ser); Xaa at 
res. 33 ■ Glu, Lys, Asp, Gin or Ala); Xaa at res. 35 « 
(Ala, Ser, Pro, Gin or Asn); Xaa at res. 36 « (Phe, Leu 
or Tyr); Xaa at res. 3 8 ■ (Leu, Val or Met); Xaa at 
res. 39 » (Asn, Asp, Ala, Thr or Pro); Xaa at res. 40 » 
(Ser, Asp, Glu, Leu, Ala or Lys); Xaa at res. 41 - (Tyr, 
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Cys, His, Ser or II ); Xaa at res .42 = (Met, Phe, Gly 
or Leu); Xaa at res .43 - (Asn, Ser or Lys); Xaa at 
res. 44 - (Ala, Ser, Gly or Pro); Xaa at res. 45 - (Thr, 
Leu or Ser); Xaa at res. 49 » (He, Val or Thr); Xaa at 
res. 50 « (Val, Leu or lie); Xaa at res. 51 - (Gin or 
Arg); Xaa at res. 52 * (Thr, Ala or Ser); Xaa at res. 53 
■ (Leu or lie); Xaa at res .54 ■ (Val or Net); Xaa at 
res. 55 - (His, Asn or Arg); Xaa at res. 5 6 - (Phe, Leu, 
Asn, Ser, Ala or Val); Xaa at res. 57 « (He, Met, Asn, 
Ala, Val or Leu); Xaa at res. 58 = (Asn, Lys, Ala, Glu, 
Gly or Phe); Xaa at res .59 - (Pro, Ser or Val); Xaa at 
res. 60 = (Glu, Asp, Gly, Val or Lys); Xaa at res. 61 - 
(Thr, Ala, Val, Lys, Asp, Tyr, Ser, Ala, Pro or His); 
Xaa at res .62 - (Val, Ala or He); Xaa at res .63 * (Pro 
or Asp); Xaa at res .64 - (Lys, Leu or Glu); Xaa at 
res .65 « (Pro or Ala); Xaa at res. 68 « (Ala or Val); 
Xaa at res. 70 » (Thr, Ala or Glu); Xaa at res. 71 * 
(Gin, Lys, Arg or Glu); Xaa at res .72 = (Leu, Met or 
Val); Xaa at res. 73 « (Asn, Ser, Asp or Gly); Xaa at 
res. 74 » (Ala, Fro or Ser); Xaa at res. 7 5 « (lie, Thr, 
Val or Leu); Xaa at res .76 = (Ser, Ala or Pro); Xaa at 
res. 77 « (Val, Met or He)? Xaa at res. 79 » (Tyr or 
Phe); Xaa at res .80 = (Phe, Tyr, Leu or His); Xaa at 
res. 81 » (Asp, Asn or Leu); Xaa at res. 82 = (Asp, Glu, 
Asn or Ser); Xaa at res. 8 3 = (Ser, Gin, Asn, Tyr or 
Asp); Xaa at res .84 = (Ser, Asn, Asp, Glu or Lys); Xaa 
at res .85 » (Asn, Thr or Lys); Xaa at res. 87 ■ (He, 
Val or Asn); Xaa at res. 8 9 » (Lys or Arg); Xaa at 
res. 90 = (Lys, Asn, Gin, His or Val); Xaa at res. 91 ■ 
(Tyr or His); Xaa at res. 92 « (Arg, Gin, Glu or Pro); 
Xaa at res. 93 - (Asn, Glu or Asp); Xaa at res. 95 = 
(Val, Thr, Ala or lie); Xaa at res.97 * (Arg, Lys, Val, 
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Asp or Glu); Xaa at res. 98 - {Ala, Gly, Glu or Ser); 
Xaa at res. 100 = (Gly or Ala>? and Xaa at res. 102 = 
(His or Arg) . 

Particularly useful sequences for use as 
morphogens in this invention include the C-terminal 
domains, e.g*/ the C-terminal 96-102 amino acid 
residues of Vgl, Vgr-1, DPP, OP-1, OP-2, CBMP-2A, 
CBMP-2B, GDP-1 (see Table II, below, and Seq. ID 
Nos. 5-14), as well as proteins comprising the 
C-terminal domains of 60A, BMP3, BMP5 and BMP 6 (see 
Seq. ID Nos. 24-28), all of which include at least the 
conserved six or seven cysteine skeleton. In addition, 
biosynthetic constructs designed from the generic 
sequences, such as COP-1, "3-5, 7, 16, disclosed in U.S* 
Pat. No. 5,011, 691, also are useful. Other sequences 
include the inhibins/activin proteins (see, for 
example, U-S. Pat. Nos. 4,968,590 and 5,011,691). 
Accordingly, other useful sequences are those sharing 
at least 70% amino acid sequence homology or 
"simjJLarity", and preferably 80% homology or similarity 
with any of the sequences above. These are anticipated 
to include allelic and species variants and mutants, 
and biosynthetic muteins, as well as novel members of 
this morphogknic family of proteins. Particularly 
envisioned in the family of related proteins are those 
proteins exhibiting morphogenic activity and wherein 
the amino acid changes from the preferred sequences 
include conservative changes, e.g., those as defined by 
Dayoff et al., Atlas of Protein Sequence and Structure ; 
vol. 5, Suppl. 3, pp. 345-362, (M.O. Dayoff , ed., Nat'l 
BioMed. Research Fdn., Washington, B.C. 1979). As used 
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herein, potentially useful s guences are aligned with a 
known morphogen sequence using the method of Needleman 
et al. ((1970) J.Mol.Biol. 48:443-453) and identities 
calculated by the Align program (DNAstar, Inc.). 
"Homology" or "similarity" as used herein includes 
allowed conservative changes as defined by Dayoff et 
al. 

Morphogen sequences which are detectable according 
to the methods of the invention include but are not 
limited to those having greater than 60% identity, 
preferably greater than 65% identity, with the amino 
acid sequence defining the conserved six cysteine 
skeleton of hOPl (e.g., residues 43-139 of Seq. ID No. 
5 ) • These most preferred sequences include both 
allelic and species variants of the OP-1 and OP-2 
proteins, including the Drosophila 60A protein. 
Accordingly, morphogens which are detectable according 
to the invention include active proteins comprising 
species of polypeptide chains having the generic amino 
acid sequence herein referred to as "OPX", which 
accommodates the homologies between the various 
identified species of OP1 and OP2 (Seq. ID No. 29). 

The morphogens detectable in the methods of this 
invention include proteins comprising any of the 
polypeptide chains described above, whether isolated 
from naturally-occurring sources, or produced by 
recombinant DNA or other synthetic techniques, and 
includes allelic and species variants of these 
proteins, naturally-occurring or biosynthetic mutants 
thereof, chimeric variants containing a domain(s) or 



WO 93/05172 



32 



PCT/US92/07359 



regi n(s) of ne^amily memb r functi nally arranged 
with another domain(s) or regions (s) of a second family 
member, as well as various truncated and fusion 
constructs. Deletion or insertion or ^addition mutants 
also are envisioned to be active, including those which 
may alter the conserved C-terminal cysteine skeleton, 
provided that the alteration dofes not functionally 
disrupt the relationship of these cysteines in the 
folded structure* Accordingly, such active forms are 
considered the equivalent of the specifically described 
constructs disclosed herein. The proteins may include 
forms having varying glycosylation patterns, varying 
N*-termini, a family of related proteins having regions 
of amino acid sequence homology, and active truncated 
or mutated forms of native or biosynthetic proteins, 
produced by expression of recombinant DNA in host" 
cells. 

The morphogenic proteins can be expressed from 
intact or truncated cDNA or from synthetic DNAs in 
procaryotic or eucaryotic host cells, and purified, 
cleaved, refolded, and dimerized to form 
morphogenically active compositions. Currently 
preferred host cells include coli or mammalian 
cells, such as CHO, COS or BSC cells. A detailed 
description of the morphogens detectable according to 
the methods of this invention is disclosed in copending 
US patent application Serial Nos. 752,764, filed August 
3a, 1991, and 667,274, filed March 11, 1991, the 
disclosure of which are incorporated herein by 
reference. 
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The scr ening m thod f the invent! n pr vides a 
simple method of determining a change in the level of 
morphogenic protein as a result of exposure of cultured 
cells to one or more compound(s). The level of a 
morphogenic protein in a given cell culture, or a 
change in that level resulting from exposure to one or 
more compound(s) indicates that direct application of 
the compound modulates the level of the morphogen 
expressed by the cultured cells. If, for example, a 
compound upregulated the production of OP-1 by a kidney 
cell line, it would then be desirable to test systemic 
administration of this compound in an animal model to 
determine if it upregulated the production of OP-1 in 
vivo . If this compound did upregulate the endogenous 
circulating levels of it would be consistent with 

administration of the compound systemically for the 
purpose of correcting bone metabolism diseases such as 
osteoporosis. The level of morphogen in the body may 
be a result of a wide range of physical conditions, 
e.g., tissue degeneration such as occurs in diseases 
including arthritis, emphysema, osteoporosis, kidney 
diseases, lung diseases, cardiomyopathy, and cirrhosis 
of the liver. The level of morphogens in the body may 
also occur as a result of the normal process of aging. 
A compound selected by the screening method of the 
invention as, for example, one which increases the 
level of morphogen in a tissue, may be consistent with 
the administration of the compound systemically or 
locally to a tissue for the purpose of preventing some 
form of tissue degeneration or for restoring the 
degenerated tissue to its normal healthy level. 
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Oth r advantages of the invention, includ 
determining the tissue or tissues of origin of a given 
morphogen in order to administer a compound aimed at 
modulating the systemic level, of morphogen for 
treatment of a disease or condition in which the level 
of morphogen production has become altered. 
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Brief Description of the Drawings 
Fig. 1 shows the fragments of OF-l, used as probes 

in Northern hybridizations useful in the processes of 

the invention* 

Fig. 2 shows results of Northern blot analysis of 

RNA using different OP-l-specif ic probes. 

Fig. 3 shows results of Northern blot analysis of 

RNA from different cells types probed with an OP-1, 

probe. 
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Detailed Descripti n 
The invention is based on the discovery of a family 
of structurally related morphogenic proteins (HHPs), also 
called osteogenic proteins (OPs), and more particularly 
that various of these proteins play an important role# not 
only in embryogenesis, but also in tissue and organ 
maintenance and repair in juvenile and adult mammals . 
Morphogenic proteins which have been identified include BMP 
2, 3, 4/ 5, 6/ OP-1 and OP-2 (murine and human)/ Vgr-1, 
Vgl/ DPP/ GDF-1, CMBP-2A/ CMBP-2B/ 60A, and the 
inhibin/activin class of proteins. Other recombinant 
proteins include C0P1, COP3/ COP4/ C0P5 r COP7/ and COP 16. 
While/ as explained herein, the morphogen have significant 
homologies and similarities in structure/ it is 
hypothesized that variants within the morphogenic protein 
genes may have specific roles in specific tissue involving/ 
for example, stimulation of progenitor cell multiplication/ 
tissue specific or tissue preferred phenotype maintenance/ 
and/or stimulation or modulation of the rate of 
differentiation/ growth or replication of tissue cells 
characterized by high turnover* The effect on the long- 
term physiology/ maintenance and repair of particular 
tissues by particular species of the morphogens is 
currently unknown in any significant detail. However/ 
methods useful in determining which particular tissues 
express which particular morphogen(s) , and for finding 
changes which stimulate or depress morphogen expression in 
vivo , would enable discovery and development of strategies 
for therapeutic treatment of a large number of diseased 
states, and provide drugs designed to implement the 
strategy. 
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This inventi n pr vides such methods and, more 
specifically, two generic processes for obtaining data 
which ultimately will permit determination of 
structure/activity relationships of specific naturally 
occurring mammalian morphogens and drugs capable of 
modulating their production. For example, using the assay 
of the invention, it has been determined that OP-1, first 
found in bone and demonstrated to be osteoinductive, is 
synthesized primarily in kidney, bladder, and adrenal 
tissue. This surprising discovery, coupled with the 
observation that patients with kidney disease often express 
loss of bone mass, suggests that the bone loss in these 
patients may be due to pathologic depression of OP-1 
synthesis in kidney, and suggests that administration of 
OP-1 systemically or stimulation of OP-1 expression and 
secretion by the kidney may arrest bone loss, or effect 
remineralization through increased bone formation (i.e., 

osteogenesis) . j 

There are two fundamental aspects bf the invention. 
One aspect involves an assay to determine tissues and cell 
types capable of synthesis and secretion of the morphogens; 
the other involves the use of the identified cell types 
configured in a screening system to find substances useful 
therapeutically to modulate, i.e., stimulate or depress, 
morphogen expression and/or secretion. 

The assay to determine the tissue of origin of a 
given morphogen involves screening a plurality (i»e., two 
or more) different tissues by determining a parameter 
indicative of production of a morphogen in the tissue, and 
comparing the parameters. The tissue(s) of origin will, of 
course, be the tissue that produces that morphogen. 
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The other assay f th invention involves screening 
candidate compounds for their ability to modulate the 
effective systemic or local concentration of a morphogen by 
incubating the compound with a cell culture that produces 
the morphogen, and assaying the culture for a parameter 
indicative of a change in the production level of the 
morphogen. Useful candidate compounds then may be tested 
for in vivo efficacy in a suitable animal model. These 
compounds tlxen may be used in vivo to modulate effective 
morphogen concentrating in the disease treatment. 

1. Morphogen Tissue Distribution 

Horphogens are broadly distributed in developing and 
adult tissue* For example, DPP and 60A are expressed in 
both embryonic and developing Drosophila tissue. Vgl has 
been identified in Xenopus embryonic tissue. Vgr-1 
transcripts have been identif ied in a variety of murine 
tissues, including embryonic and developing brain, lung, 
liver, kidney and calvaria (dermal bone) tissue. In 
addition^ both CBMP2B and CBMP3 have been identified in 
lung tissue. Recently, Vgr-1 transcripts also have been 
identified in adult murine lung, kidney, heart, and brain 
tissue, with particularly high levels in the lung (see 
infra). GDF-1 has been identified in human adult 
cerebellum and in fetal brain tissue. In addition, recent 
Northern blot analyses indicate that OP-1 is encoded by 
multiple transcripts in different tissues. This potential 
alternative splicing is consistent with the hypothesis that 
the longer transcripts may encoded additional proteins 
(e.g., bicistronic mRNA) and each form may be tissue or 
developmentally related* 
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OP-1 and the CBMP2 pr teins, both first id ntified 
as bone m rphogens, have been identified in mouse and human 
placenta, hippocampus, calvaria and osteosarcoma tissue as 
determined by identification of OP-1 and CMBP2-specif ic 
sequences in cDNA libraries constructed from these tissues 
(see USSN 422,699, incorporated herein by reference)* 
Additionally, the OP-1 protein is present in a variety of 
embryonic and developing tissues including kidney, liver, 
heart and brain as determined by Western blot analysis and 
imraunolocalization (see infra). OP-l-specific transcripts 
also have been identified in both embryonic and developing 
tissues, most abundantly in developing kidney, bladder, 
adrenal and (see infra). OP-1 also has been identified as 
a mesoderm inducing factor present during embryogenesis. 
Moreover, OP-1 has been shown to be associated with 
satellite cells in the muscle and associated with potential 
pluripotential stem cells in bone marrow following damage 
to adult murine endochondral bone, indicating its 
morphogenic role in tissue repair and regeneration. In 
addition, a novel protein GDF-1 comprising a 7 cysteine 
skeleton, has been identified in neural tissue (Lee, 1991, 
Proc. Nat. Aca. Sci. 88: 4250-4254). 

Knowledge of the tissue distribution of a given 
morphogen may be useful in choosing a cell type for 
screening according to the invention, or for targeting that 
cell type or tissue type for treatment. Xhe proteins (or 
their mRNA transcripts) are readily identified in different 
tissues using standard methodologies and minor 
modifications thereof in tissues where expression may be 
low. For example, protein distribution may be determined 
using standard Western blot analysis or immunocytochemical 
techniques, and antibodies specific to the morphogen or 
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morphogens of interest* Similarly, the distribution of 
morphogen transcripts may be determined using standard 
Northern hybridization protocols and a transcript-specific 
probe and hybridization conditions. 

2. Useful Morphogens 

As defined herein a protein is morphogenic if it is 
capable of inducing the developmental cascade of cellular 
and molecular events that culminate in the formation of 
new, organ-specific tissue and comprises at least the 
conserved C-terminal six cysteine skeleton or its 
functional equivalent (see supra*. Specifically, the 
morphogens generally are capable of all of the following 
biological functions in a morphogenically permissive 
environment: stimulating proliferation of progenitor 
cells; stimulating the differentiation of progenitor cells; 
stimulating the proliferation of differentiated cells; and 
supporting the growth aind maintenance of differentiated 
cells, including the "redifferentiation" of transformed 
cells. Details of how the morphogens detectable according 
to the methods of this invention first were identified, as 
well as a description on how to make, use and test them for 
morphogenic activity are disclosed in CSSN 667,274, filed 
March 11, 1991 and USSN 752,764, filed August 30, 1991, the 
disclosures of which are hereby incorporated by reference* 
As disclosed therein, the morphogens may be purified from 
naturally-sourced material or recombinantly produced from 
procaryotic or eucaryotic host cells, using the genetic 
sequences disclosed therein. Alternatively, novel 
morphogenic sequences may be identified following the 
procedures disclosed therein. 
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Particularly useful pr teins Includ th s which 
c mprise the naturally derived sequences disci sed in Table 
II. Other useful sequences include biosynthetic constructs 
such as those disclosed in U.S. Pat. 5,011,691, the 
disclosure of which is incorporated herein by reference 
(e.g., COP-1, COP-3, COP-4, COP-5, COP-7, and COP-16). 

Accordingly, the morphogens detectable according to 
the methods and compositions of this invention also may be 
described by morphogenically active proteins having amino 
acid sequences sharing 70% or, preferably, 80% homology 
(similarity) with any of the sequences described above, 
where "homology" is as defined herein above. 

The morphogens detectable according to the method of 
this invention also can be described by any of the 6 
generic sequences described herein (Generic Sequences 1, 2, 
3, 4, 5 and 6). Generic sequences 1 and 2 also may 
include, at their N- terminus, the sequence 

Cys Xaa Xaa Xaa Xaa (Seq. ID No. 15) 
1 5 

Table II, set forth below, compares the amino acid 
sequences of the active regions of native proteins that 
have been identified as morphogens, including human OP-1 
(hOP-1, Seq. ID Nos. 5 and 16-17), mouse OP-1 (mOP-1, Seq. 
ID Nos. 6 and 18-19), human and mouse OP-2 (Seq. ID Nos. 7, 
8, and 20-23), CBMP2A (Seq* ID No. 9), CBMP2B (Seq. ID 
No. 10), BMP 3 (Seq. ID No. 26), DPP (from Drosophila, Seq. 
ID No. 11), Vgl, (from Xenopus, Seq. ID No. 12), Vgr-1 
(from mouse, Seq. ID No. 13), GDF-1 (from mouse, Seq. ID 
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Nos. 14, 32 and 33), 6 OA protein (from Dros phila, Seq. ID 
Nos. 24 and 25), BMP5 (Seq. ID Ho. 27) and BMP6 (Seq. ID 
No. 28). The sequences are aligned essentially following 
the method of Needleman et al. (1970) J. Mol. Biol., 
48:443-453, calculated using the Align Program (DNAstar, 
Inc.) In the table, three dots indicates that the amino 
acid in that position is the same as the amino acid in 
hOP-1. Three dashes indicates that no amino acid is 
present in that position, and are included for purposes of 
illustrating homologies. For example, amino acid residue 
60 of CBMP-2A and CBMP-2B is "missing". Of course, both 
these amino acid sequences in this region comprise Asn-Ser 
(residues 58, 59), with CBMP-2A then comprising Lys and 
lie, whereas CBMP-2B comprises Ser and lie. 
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TABLE II 



bOP-1 


Qys 


Lys 


Lys 


His 


Glu 


Leu 


Tjrr 


Val 




oOP-1 


• * • 


... 


• • . 


• . * 


... 


... 


• • • 


... 




hOF-2 


• * • 


Arg 


Arg 


» • • 


... 


... 


• • • 


... 




mOP-2 




Arg 


Arg 


... 


« » . 


... 


• . 4 


• * • 




DPP 




Arg 


Arg 


... 


Ser 


♦ . . 




... 




Vgl 






Lys 


Arg 


His 










Vgr-1 


... 


... 


... 


... 


Gly 


... 


... 


... 




CBMP-2A 




* * • 


Arg 


• . . 


Pro 




... 






CBMP-2B 


... 


Arg 


Arg " 


. • 


Ser 


... 


... 


... 




BHP3 


... 


Ala 


Arg 


Arg 


Tyr 


... 


Lys 


... 




GDF-1 




Arg 


Ala 


Arg 


Arg 


... 


... 






60A 




Gin 


Met 


Glu 


Thr 


... 


... 






BMP5 


... 


... 


... 


... 


» • • 




. . * 


... 




BMP 6 




Arg 


... 


... 


... 


... 


* * . 


... 






1 








5 










hOP-1 


Ser 


Phe 


Arg 


Asp 


Leu 


Gly 


Trp 


Gin 


Asp 


mOP-1 


• * • 




4 . . 




... 






... 


... 


hOP-2 


... 


... 


Gin 


... 


... 


... 


... 


Leu 


• • . 


mOP-2 


Ser 




... 




... 






Leu 


... 


DPP 


Asp 




Ser 




Val 






Asp 


... 


Vgl 


Glu 




Lys 




Val 






... 


Asn 


Vgr-1 






Gin 




Val 






... 


... 


CBHP-2A 


Asp 




Ser 




Val 






Asn 




CBHP-2B 


Asp 




Ser 




Val 






Asn 


... 


BMP3 


Asp 




Ala 




lie 






Ser 


Glu 


GDF-1 


... 




« * • 


Glu 


Val 






His 


Arg 


60A 


Asp 




Lys 




... 






His 


* • • 
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• • • • * 



BMP5 

BMP 6 Gin - 

10 15 
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hOP-1 

mOP-1 

hOP-2 

roOP-2 

DPP 

Vgl 

Vgr-1 

CBHP-2A 

CBMP-2B 

BMP3 

GDF-1 

60A 

BMP5 

BMP6 



Trp lie lie Ala Pro Glu Gly Tyx Ala 



• • mi 



Val 
Val 

• • • 

Val 



Val 



Val 



Val 
Val 



Ser 



i • • • • 



••• 



i • « ■ • • 



Gin 
Gin 
Leu 
Gin 
Lys 
Pro 
Pro 



Lys 



Ser 
Ser 
Asp 
Met 

• • • 

His 
Gin 



Lys Ser Phe Asp 
Arg ... Pbe Leu 

••» 



20 



25 



hOP-1 

mOP-1 

hOP-2 

raOP-2 

DPP 

Vgl 

Vgr-1 

CBHP-2A 

CBHP-2B 

BHP3 

GDF-1 

60A 

BMP5 

BMP 6 



Ala Tyr Tyr Cys Glu Gly Glu Cys Ala 



Asn 
Asn 
Phe 
Phe 

• • 

Asn 
Phe 
Phe 
Asn 



His 
Tyr 
Asp 
His 
His 
Ser 
Gin 
Ser 
Asp 
Asp 



Lys 



Glu 
Asp 
Ala 
Gin 



30 



Ser 

* * • 

Pro 
Pro 
Ser 
Pro 
Pro 
Gin 

• • • 

Asn 
Ser 
Ser 
35 
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hOP-1 


Phe 


Pro 


Leu 


























DPP 
















Vgr-1 


• • • 


• •.» 


• * • 




* • • 


• #- • 




CBHP-2B 


• • * 


• • • 


w • • 




Lett 




Val 


RHP 3 


• • • 




Het 


DUa 


• • • 






BMP 5 








BHP6 








hOP-1 


Thr 


Asn 


His 


tnOP-1 
iiiur i 


• • • 




* * * 


hOP -2 








mOP-2 








npp 


• • • 






Vcrl 
















CBHP-ZA 


» «- • 


• • • 


• • * 


CBMP-2B 


• • • 


• * • 


• » • 


HHP3 


Ser 




• * • 


GDF-1 


Leu 


• • • 


» * * 


60A 


• • • 


« • • 




BMP5 




• * • 


• • m 


BMP 6 


• • 


• • • 


• • • 
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Asn 


Ser 


Tyr 


Het 


Asn 


Ala 


Asp 


• » • 


Cys 


• • 

• « • 


• » • 

• • • 


• • • 

♦ • ♦ 


Asp 

* 


• • • 


Cys 


• • * 


• • • 


• • • 


Ala 


Asp 


His 


Phe 


• • • 


Ser 


Thr 


Glu 


He 


Leu 


• • • 


Gly 




Ala 


His 






• # • 


Ala 


Asp 


His 


Leu 


• * • 


Ser 


Ala 


Asp 


His 


Leu 


• • • 


Ser 


Ala 


Leu 


Ser 


Gly 


Ser** 


• • •• 


Pro 


Lys 


Ser 


Leu 


Lys 


Pro 




Ala 


His 






• • • 




Ala 


His 


Het 


• • • 


• • -* 


• • # 


Ala 


His 


Het 


• » «- 


• • • 




40 










Ala 


lie 


Val 


Gin 


Thr 


Leu 


• • • 


• •■ • 


• • • 

Leu 


• • • 

• * • 


m m m 

Ser 


• • • 

• • • 


• • «• 


* * 


Leu 


• « • 


Ser 


• » • 


• • • 


Val 


• • 




• • • 


• * * 


• • • 

• • •■ 


• • « 

• • • * 


Leu 

• • *■ 


• • • 


* * * 


• • 


• • • 

• • • 

• • » 


* « * 

• • • 

Thr 


• ♦ • 

• • • 
lie 


• • * 

• • • 

• • * 


• • • 

« • • 

Ser 


» • • 
• • » 

lie 


• • • 


Val 


Leu 


Arg 


Ala 


• • • 


* • • 


• ■ • 


» • * 


• * * 


• • • 


» • • 


♦ • • 


♦ m 


• * • 


• • 


• « « 


• • • 


• • « 


• v * 


• • • 


• * * 


• • * 


■ • • 
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hOP-1 

mOP-1 

hOP-2 

mOP-2 

DPP 

Vgl 

Vgr-1 

CBMP-2A 

CBHP-2B 

BMP3 

GDP-1 

60A 

BMP5 

BMP 6 



Val 



hOP-1 

mOP-1 

hOP-2 

nOP-2 

DPP 

Vgl 

Vgr-1 

CBMP-2A 

CBMP-2B 

BHP3 

GDF-1 

60A 

BMP5 

BMP6 



Met 



His 


Phe 


lie 


Asn 


Pro 


. Glu 


Thr 


Val 


• • • 


• • • 


* • • 


« • • 


• • * 


Asp 


• • • 


• • • 


His 


Leu 


Met 


Lys 


• • • 


Asn 


Ala 


• • • 


His 


Leu 


Met 


Lys* 


• • • 


Asp 


Val 


• • • 


Asn 


Asn 


Asn 


• • • 


• • * 


Gly 


Lys 


• • • 


* • • 


Ser 


* ♦ • 


Glu 


• * • 


• • « 


Asp 


lie 


• • • 


Val 


Met 


• • • 


• • • 


• • • 


Tyr 


• • • 


Asn 


Ser 


Val 


* • ♦ 


Ser 


— 


Lys 


He 


Asn 


Ser 


Val 


• • • 


Ser 




Ser 


He 


Arg 


Ala** 


Gly 


Val 


Val 


Pro 


Gly 


He 


* * • 


Ala 


Ala 


Ala 


• • • 


Gly 


Ala 


Ala 


• • • 


Leu 


Leu 


GXU 


• • • 


Lys 


Lys 


• • • 




Leu 


Met 


Phe 




Asp 


His 


• * * 


• • • 


Leu 


Het 


• • • 


• • • 


• mm 


Tyr 


• • • 


55 










60 






Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Gin 


• • • 

• • • 


• • • 
Ala 


• • • 

• • • 


* ■ • 

Oft* 


* • • 

• • • 


» • • 
• • • 


• • • 

• • • 


• • • 

Lys 


• • • 


Ala 


• * • 


• ft ft 


• • • 


* • • 


*- * » 


Lys 


• • * 


Ala 


• * • 


a m « 


Val 


• • • 


• • • 


• • • 


Leu 


• • • 


• • • 


• ft ♦ 


Val 


• • • 


• • • 


Lys 


• • • 


• « • 


• • • 


• ft • 


• • • 


• ♦ • 


• • • 


Lys 


• • • 


Ala 


* • • 




Val 


* * * 


• « • 


Glu 


• • « 


Ala 


• • • 




Val 


• • • 


• m • 


Glu 


Glu 


• • • 


• • • 




Val 




Glu 


Lys 


Leu 


• • • 


• • * 


• • • 


Val 


• • • 


Ala 


Arg 


• • • 




• • • 




• • « 


• • • 


ft • ft 


Arg 






• • * 






• • • 


ft * ft 


Lys 


• • • 




• • • 






• • • 


ft ft- ft 


Lys 




65 










70 
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hOP-1 


Leu 


Asn 


Ala 


He 


Ser 


Val 


Leu 


Tyr 


Ph 


mOP-1 




• • * 








* • ♦ 




• • • 




hOP-2 




Ser 


• m • 


Thr 


• • * 


• • • 


• • • 


• . • 


Tyr 


mOP-2 


* . • 


Ser 


• * 0 


Thr 




mm*- 


• « * 


• • • 


Tyr 


Vgl 


Met 


Ser 


Pro 


• • • 


• * • 


Het 


• * ♦ 


Phe 


Tyr 


Vgr-1 


Val 


• •■ *> 


* • * 


• • • 


• • • 


• . • 


# • • 


• • * 


•> • • 


DPP 


• «■ • 


Asp 


Ser 


Val 


Ala 


Het 


• • • 


• • • 


Leu 


CBMP-2A 


m * • 


Ser 


m r w 


* * * 


• * • 


Het 


• • » 


• • • 


Leu 


CBHP-2B 


• • * 


Ser 


. •- * 




• » • 


Het 


*■ * • 


* » * 


Leu 


BHP3 


Met 


Ser 


Ser 


Leu 


• » • 


lie 


• 0 • 


Phe 


Tyr 


GDF-1 


• • 


Ser 


Pro 


» • 


• • *■ 


• • • 


•> • • 


Phe 


• • • 


60A 




Gly 


• • • 


Leu 


Pro 


• * •< 


... 


• • • 


His 


BHP5 




• • 


*■ w m 


• * * 


•- • • 


•r • • - 


• mm 


• • • 


• • • 


BMP6 




• • » 


• a a 


* • • 




• • * 


m -m • 


. • • 


* « * 










75 










80 


hOP-1 


Asp 


Asp 


Ser 


Ser 


Asn 


Val 


lie 


Leu 


Lys 


mOP-1 




• • • 


*■ » • 


* • • 


• • 


•. • • 


• *■ • 


• « « 


• • • 


hOP-2 




Ser 


• * • 


Asn 




• «- • 


• •- • 


* * * 


Arg 


mOP-2 




Ser 


• •* • 


Asn 


• •> • 


• • • 


m » m 


• * * 


Arg 


DPP 


Asn 




Gin 


• • • 


Thr 


• » * 


Val 


• - m 


... 


Vgl 


• • • 


Asn 


Asn 


Asp 


• •> • 


• • • 


Val 


... 


Arg 


Vgr-1 


... 


* • • 


Asn 


* * • 


♦ • • 


• • • 


• • • 




. » * 


CBHP-2A 


* • « 


Glu 


Asn 


Glu 


Lys 


• • • 


Val 






CBHP-2B 


« • • 


Glu 


Tyr 


Asp 


Lys 


• • • 


Val 




. . » 


BHP3 


* * • 


Glu 


Asn 


Lys 




• • • 


Val 




• • • 


GDF-1 


. » • 


Asn 


• • • 


Asp 


• «• • 


• • • 


Val 




Arg 


60A 


Leu 


Asn 


Asp 


Glu 


• • • 


• * * 


Asn 




• • • 


•1HP5 


* * • 


m- m m 


• • * 


* • m 


• • • 


• • • 


• • • 




♦> • • 



BMP6 ... Asn 



WO 93/05172 



49 



PCT/US92/07359 



hOP-1 
nOP-1 
hOP-2 
mOP-2 
DPP 

Vgl 

Vgr-1 

CBMP-2A 

CBHP-2B 

BMP3 

GDF-1 

60A 

BMP5 

BHP6 



Lys Tyr Arg Asn Met Val Val Arg 



Asn 

His 

• • * 

Asn 
Asn 
Val 
Gin 



His 
His 



Gin 
Glu 



Glu 



Gin Asp 

Gin Glu 
Pro 

Glu Asp 



Trp 



Thr 
Ala 



90 



Thr 
lie 

• * • 

• • • 

95 



Lys 
Lys 
Val 
Asp 
• • • 
Glu 
Glu 
Glu 
Asp 
Lys 



hOP-1 

mOP-1 

hOP-2 

raOP-2 

DPP 

Vgl 

Vgr-1 

CBHP-2A 

CBMP-2B 

BHP3 

GDF-1 

60A 

BMP5 

BMP 6 



Ala Cys Gly Cys His 



Gly 
Glu 

• • * 

Gly 
Gly 
Ser 
Glu 
Ser 
Ser 



Ala 



Arg 
Arg 
• * * 
Arg 
Arg 
Arg 
Arg 



100 



WO 93/05172 



50 



PCT/US92/07359 



★♦Between residues 56 and 57 £ BMP3 is a Val x sidu ; 

between residues 43 and 44 of GDF-1 lies 
the amino acid sequence Gly-Gly-Pro-Pro. 
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As is appar nt from th £ r g ing amino acid 
sequence comparisons, significant amino acid changes 
can be made within the generic sequences while 
retaining the morphogenic activity. For example, while 
the GDF-1 protein sequence depicted in Table II shares 
only about 50% amino acid identity with the hOPl 
sequence described therein, the GDF-1 sequence shares 
greater than 70% amino acid sequence homology (or 
"similarity") with the hOPl sequence, where "homology" 
or "similarity" includes allowed conservative amino 
acid changes within the sequence as defined by Dayoff , 
et al.. Atlas of Protein Sequence and Structure vol.5, 
supp.3, pp. 345-362, (M.O. Dayoff, ed. , Nat'l BioMed. 
Res. Fd'n, Washington D.C. 1979.) 

The currently most preferred protein sequences 
detectable as morphogens in this invention include 
those having greater than 60% identity, preferably 
greater than 65% identity, with the amino acid sequence 
defining the conserved six cysteine skeleton of hOPl 
(e.g., residues 43-139 of Seq. ID No. 5). These most 
preferred sequences include both allelic and species 
variants of the OP-1 and OP-2 proteins, including the 
Drosophila 6 OA protein. Accordingly, in still another 
preferred aspect, the invention includes detection of 
morphogens comprising species of polypeptide chains 
having the generic amino acid sequence referred to 
herein as "OPX", which defines the seven cysteine 
skeleton and accommodates the identities between the 
Various identified mouse and human OP1 and OP 2 
proteins. OPX is presented in Seq. ID No. 29. As 
described therein, each Xaa at a given position 
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ind p ndently is sel cted from the residues ccurring 
at the corresp riding position in the C-terminal 
sequence of mouse or human OP1 or 0P2 (see Seg. ID 
Nos. 5-8 and/or Seq. ID Nos. 16-23). 

3. Tissue-Specific Expression of OP-1 

Once a morphogen is identified in a tissue, its 
level may be determined either at the protein or nucleic 
acid level. By comparing the levels of production of a 
given morphogen among different tissues, it is possible to 
determine the tissue(s) of origin of that morphogen. The 
level of production of the morphogen OP-1 in different 
tissues is one example of a morphogen having a tissue of 
origin, i.e., the kidney, which contains a cell type that 
can also be used as the cell type which is used to screen, 
according to the invention, different compounds for their 
potential effects on morphogen (OP-1) production. 

The level of OP-1 varies among different tissue 
types. In order to screen compounds for their effect on 
the production of OP-1 by a given cell type, it may be 
desirable to determine which tissues produce levels of OP-1 
which are sufficiently high to show a potential decrease 
and sufficiently low to show a potential increase in 
production. Different tissues may be screened at the RNft 
level as follows. 

Any probe capable of hybridizing specifically to a 
transcript, and distinguishing the transcript of interest 
from other, related transcripts may be used. Because the 
morphogens to be detected in the methods of this invention 
bhare such high sequence homology in their C-terminal 
domain, the tissue distribution of a specific morphogen 
transcript may best be determined using a probe specific 
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for the "pro" region of the immature protein and/or the 
N-t rminal heter gene us r gion of the mature pr tein. 
Another useful probe sequence is the 3 'non-coding region 
immediately following the stop codon. These portions of 
the sequence vary substantially among the morphogens of 
this invention, and accordingly, are specific for each 
protein. For example, a particularly useful Vgr-l-specif ic 
probe sequence is the PvuII-SacI fragment, a 265 bp 
fragment encoding both a portion of the pro region and the 
N- terminus of the mature sequence. Similarly, particularly 
useful mOP-1- specific probe sequences are the BstXI-Bgll 
fragment, a 0.68kb sequence that covers approximately two- 
thirds of the mOPl pro region? a StuI-StuI fragment, a 0.2 
kb sequence immediately upstream of the 7 -cysteine domain, 
and an Earl-PstI fragment, a 0.3kb fragment containing the 
3 'untranslated sequence. Similar approaches may be used, 
for example, with hOP-1 (SEQ. ID NO. 16) or human or mouse 
OP-2 (SEQ. ID NOS.20 and 22). 

Using morphogen- specific oligonucleotides probes, 
morphogen transcripts can be identified in mammalian 
tissues, using standard methodologies well known to those 
having ordinary skill in the art. Briefly, total UNA from 
mouse embryos and organs from post-natal animals is 
prepared using the acid guanidine thiocyanate-phenol- 
chloroform method (Chomczynski et al., Anal. Biochem. 
162:156-159, 1987). The RNA may be dissolved in TES buffer 
(10 mM Tris-HCl, 1 mM EDTA, 0.1% SDS, pH 7.5) and treated 
with Proteinase K (approx. 1.5 mg per g tissue sample) at 
45°c for 1 hr. Poly(A)* RNA selection on oligo(dT)- 
eellulose (Type 7, Pharmacia LKB Biotechnology Inc., 
Piscataway, NJ) may be done in a batch procedure by mixing 
0.1 g oligo(dT)-celiulose with 11 ml RNA solution (from 1 g 
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tissue) in TES buffer and 0.5 M NaCl). Th r after th 
oligo(dT) cellulose is washed in binding buffer (0.5 M 
NaCl, 10 mM Tris-HCl, 1 mM EDTA, pH 7.5) and poly(A) + RNA 
is eluted with water. Poly(A) + RNA (5 or 15 pg/lane) is 
fractionated on 1 or 1.2% agarose-formaldehyde gels 
(Selden, in Current Protocols in Molecular Biology, Ausubel 
et al. eds., pp. 1-4, B, 9, Greene Publishing and 
Wiley-Interscience, New York, 1991) . 1 fil of 400 pg/ml 
ethidium bromide is added to each sample prior to heat 
denaturation (Rosen et al.. Focus 12:23-24, 1990). 
Following electrophoresis, the gels are photographed and 
the RNA is blotted overnight onto Nytran nitrocellulose 
membranes (Schleicher & Schuell Inc., Keene, NH) with 10 x 
SSC. The membranes are baked at 80°C for 30-60 min. and 
irradiated with UV light (1 mW/cra 2 for 25 sec). The 
Northern hybridization conditions may be as previously 
described (Ozkaynak et al., EMBO J. 9s2085-2093, 1990). 
For re-use, the filters may be deprobed in 1 mH Tris-HCl, 
1 mM EDTA, 0.1% SDS, pH 7.5, at 90-95°C and exposed to film 
to assure complete removal of previous hybridization 
signals . 

One probe which may be used to screen for 
transcripts encoding a morphogen includes a portion of or 
the complete OP-1 cDNA, which may be used to detect the 
presence of OP-1 mRNA or mRNAs of related morphogens. The 
sequence of the murine cDNA gene is set forth in SEQ ID 
N0:14. 

OP-1 mRNA expression was analyzed in 17 day mouse 
embryos and 3 day post-natal mice by sequentially 
hybridizing filters with various probes. Probes from 
regions other than the highly conserved 7-cysteine domain 
were selected because this region is highly variable among 
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members f -the TGF-0 super family . Fig. 1 shows, the 
fragments of OP-1, us d as prob s in the Northern 
hybridizations. The solid box indicates the putative 
signal peptide and the hatched box corresponds to the 
TGF-p-like domain that contains the seven cysteine 
residues. Asterisks indicate the potential N-glycosylation 
sites. The arrow marks the location of the cleavage site 
for OP-1 maturation. Three solid bars below the diagram 
indicate the OP-1 specific fragments used in making 
32 P-labeled probes (0.68 kb BstXI - Bgll fragment, 0.20 kb 
StuI - StuI fragment and 0.34 kb Earl - PstI non-coding 
fragment) • 

Hybridization with a probe that covers approximately 
two thirds of the pro region (the 0.68 kb BstXI-Bgll 
fragment), reveals a 4 kb message and 3 messages at 1.8 kb, 
2.2 kb and 2.4 kb (Fig. 2B and D, and Fig. 3). In the 
Northern hybridization of Fig. 2, equal amounts (15 /jg) of 
poly (A) * RNA were loaded into each lane, electrophoresed on 
a 1% agarose-formaldehyde gel, blotted and hybridized* A 
0.24 - 9.49 kb RNA ladder (Bethesda Research Labs, Inc.) 
was used as size standard. The same filter was used for 
sequential hybridizations with labeled probes specific for 
OP-1 (Panels B and D), Vgr-1 (Panel C), and EF-Tu 
(Panel A). Panel A: the EF-Tu specific probe (a control) 
was the 0.4 kb Hindlll-SacI fragment (part of the coding 
region), the SacI site used belonged to the vector; Panel 
Bt the OP-1 specific probe was the 0.68 kb BstXI-Bgll 
fragment (two thirds of the pro region and upstream 
sequences of the mature domain, not including any sequences 
from the 7 -cysteine domain); Panel C: the Vgr-1 specific 
probe was the 0.26 kb Pvuil-SacI fragment (part of the pro 
region and the amino-terminal sequences of the mature 
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domain, including the first cysteine) (Ly ns t al., 1989, 
Proc. Nat. Aca. Sci. 86: 4554, hereby incorporated by 
reference). Panel D: the OP-1 (3' flanking) specific 
probe was the 0.34 kb Earl-Pstl fragment (3 r untranslated 
sequences immediately following the sequences encoding OP- 

1). 

In Fig. 3, the tissues to be used for RNA 
preparation were obtained from two week old mice (Panel A) 
or 5 week old mice (Panel B), with the exception of poly A+ 
RNA which was obtained from kidney adrenal gland of two 
week old mice (Panel B). Equal amounts of poly A+ RNA 
(15 pg for Panel A and 5 /jg for Panel B) were loaded into 
each well. After electrophoresis (1.2% agarose-' 
formaldehyde gels) and blotting, RNA was hybridized to the 
OP-1 specific 3' flanking probe described in the legend of 
Fig. 2 (Panel D). The 0.24-9.5 kb RNA ladder was used as 
size standard. The arrowheads indicate the 0P~1 specific 
messages. The lower section of Panels A and B show the 
hybridization pattern obtained with the EF-Tu specific 
probe (a control) . 

Although the size of the Vgr-1 specific message is 
close to the 4 kb OP-1 species (Fig. 2 Panel C), the OP-1 
4 kb mRNA is somewhat larger. To further rule out cross- 
hybridization with a non-OP-1 message, the 0.2 kb StuI-StuI 
fragment which represents the gene specific sequences 
immediately upstream of those encoding the 7-cysteine 
domain was used. This probe gave a hybridization pattern 
similar to the one shown in Fig. 2 Panel B (data not 
shown). A third probe, the 0.34 kb Earl-Pstl fragment 
containing 3' untranslated sequences, also confirmed the 
pattern (Fig. 2 Panel D). Thus, the same four OP-1 
specific messages were observed with three distinct probes. 
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The appearance of a new 4 kb OP-1 mRNA species was 
initially interpreted as cross hybridization of the OP-1 
probe with Vgr-1 mRNA, which is approximately this size 
(Fig. 2 Panel C). However, the 4 kb message was detected 
with three different OP-1 specific probes, including one 
specific to the 3' untranslated region, and moreover it was 
separated from Vgr-1 message on the basis of size* Most 
likely, therefore, the 4 kb mRNA (and the three species of 
1.8 kb, 2.2 kb and 2.4 kb) results from alternative 
splicing of OP-1 transcripts ♦ The 4 kb OP-1 mRNA could 
also represent a bicistronic mRNA. The. 4 kb message is a 
minor species in kidney, while it is more prominent in 
adrenal tissue. 

The level of OP-1 expression was compared in 
different tissues using poly (A) * RNA prepared from brain, 
spleen, lung, kidney and adrenal gland, heart, and liver of 
13 day post-natal mice. The RNA was analyzed on Northern 
blots by hybridization to various probes (Fig. 3. Equal 
amounts of mRNA, as judged by optical density, were 
fractionated on agarose formaldehyde gels. Ethidium 
bromide staining of the gels revealed some residual 
ribosomal RNA in addition to the mRNA and provided another 
assurance that the mRNA was not degraded and that there was 
not significant quantitative or qualitative variation in 
the preparation. As control for mRNA recovery, EF-Tu 
(translational elongation factor) mRNA was probed (assuming 
uniform expression of EF-Tu in most tissues). A great 
variation in the level of OP-1 expression was observed in 
.spleen, lung, kidney and adrenal tissues whereas EF-Tu mRNA 
levels appeared relatively constant in these tissues 
(Fig. 3 Panel A). The highest level of OP-1 mRNA was found 
in the kidneys. Uniformly lower levels of EF-Tu mRNA were 
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found in brain, heart and liver (Jig. 3 Panel A) * 
Additional analysis of OP-1 mRNA showed the presence of 
significant amounts of OP-1 mRNA in the bladder (data not 
shown) . In summary/ next to kidney, bladder and adrenal 
tissue, bra^n tissue contained the highest levels of OP-1 
UNA, whereas heart and liver did not give detectable 
signals* 

OP-1 mRNA patterns display qualitative changes in 
the various tissues. Of the four messages found in brain, 
the 2.2 Kb message is most abundant whereas in lung and 
spleen the 1*8 kb message predominates. Levels of the 
1.8-2.4 kb in the kidney OP-1 mRNA are approximately two 
times higher in 3 day post-natal mice than in 17 day 
embryos, perhaps reflecting phases in bone and/or kidney 
development. mRNA was also prepared from carefully 
separated renal and adrenal tissues of 5 week old mice. 
Northern blot analysis (Figure 4, Panel B) revealed that 
the high levels of 2.2 kb mRNA were derived from renal 
tissue whereas the 4 kb mRNA was more prominent in adrenal 
tissue. 

The detection of of OP-1 message primarily in the 
kidney but also in bladder links OP-1 expression 
specifically with the urinary tract. Interestingly, the 
related Vgr-1 is also expressed at significant levels in 
kidney although its main site of expression in lung. 

Once the tissue-specific expression of a given 
morphogen is known, cell types known to exist in that 
tissue or cell lines derived from that tissue can be 
screened, in a similar maimer, to identify the cell type 
within that tissue that is actually responsible for the 
tissue specific synthesis and secretion of the morphogen. 
Once a cell type which produces the morphogen in an amount 
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sufficient 1 ^ detect increases r deer ases in th 
production level of the morphogen upon exp sure to a 
compound is identified, it may be used in tissue cultures 
assay to rapidly screen for the ability of compound to 
upregulate or down regulate the synthesis and secretion of 
the morphogen. The level of morphogen production by the 
chosen cell type is determined .with and without incubating 
the cell in culture with the compound, in order to assess 
the effects of the compound on the cell's ability to 
synthesize or secrete the morphogen. This can be 
accomplished by detection of the level of production of the 
morphogen either at the protein or mRNA level. 

4. Growth of Cells in Culture 

Cell cultures derived from kidney, adrenals, urinary 
bladder, brain, or other organs, may be prepared as 
described widely in the literature. For example, kidneys 
may be explanted from neonatal, new bom, young or adult 
rodents (mouse or rat) and used in organ culture as whole 
or sliced (1-4 mm) tissues. Primary tissue cultures and 
established cell lines, also derived from kidney, adrenals, 
urinary, bladder, brain, or other tissues may be 
established in multiwell plates (6 well, 24 well, or 96 
well) according to conventional cell culture techniques, 
and are cultured in the absence or presence of serum for a 
period of time (1-7 days). Cells may be cultured,* for 
example, in Dulbecco's Modified Eagle medium (Gibco, Long 
Island, NY) containing serum (e.g., fetal calf serum at 1%- 
10%, Gibco) or in serum-deprived medium, as desired, or in 
defined medium (e.g., containing insulin, transferrin, 
glucose, albumin, or other growth factors)* 
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Samples f r testing the level of morphogen 
production include culture supernatants or cell ly sates, 
collected periodically and evaluated for OP-1 production by 
imraunoblot analysis of a portion of the cell culture 
itself r collected periodically and used to prepare polyA+ 
RNA for RNA analysis (Sambrook et al., eds., Molecular 
Cloning, 198 9 , Cold Spring Harbor Press , Cold Spring 
Harbor , NY)* To monitor de novo OP-1 synthesis, some 
cultures are labeled with 3 5 S-methionine/ 3 5 S-cysteine 
mixture for 6-24 hours and then evaluated for morphogen 
production by conventional immunoprecipitation methods 
(Sambrook et al., eds-, Molecular Cloning, 1989, Cold 
Spring Harbor Press, Cold Spring Harbor, NY} • 
Alternatively, the production of morphogen or determination 
of the level of morphogen production may be ascertained 
using a simple assay for a parameter of cell growth, e.g., 
cellular proliferation or death* For example, where a 
morphogen is produced by a cultured cell line, the addition 
of antibody specific for the morphogen may result in relief 
from morphogen inhibition of cell growth. Thus, 
measurement of cellular proliferation can be used as an 
indication of morphogen production by a tissue. 

5. Determination of Level of Morphogenic Protein 
In order to quantitate the production of a 
morphogenic protein by a cell type, an immunoassay may be 
performed to detect the morphogen using a polyclonal or 
monoclonal antibody specific for that morphogen* For 
example, OP-1 may be detected using a polyclonal antibody 
specific foir OP-1 in an ELISA, as follows* 

1 f/g/100 ul of affinity-purified polyclonal rabbit 
IgG specific for OP-1 is added to each well of a 96-well 
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plate and incubat d at 37°C for an hour. The w lis are 
washed four times with 0.16M sodium borate buffer with 0.15 
M NaCl (BSB), pH 8.2, containing 0.1% Tween 20. To 
minimize non-specific binding, the wells are blocked by 
filling completely with 1% bovine serum albumin (BSA) in 
BSB for 1 hour at 37 °C. The wells are then washed four 
times with BSB containing 0*1% Tween 20. A 100 ul aliquot 
of an appropriate dilution of each of the test samples of 
cell culture supernatant is added to each well in 
triplicate and incubated at 37 °C for 30 min* After 
incubation, 100 ul biotinylated rabbit anti-OP-1 serum 
(stock solution is about 1 mg/ml and diluted 1:400 in BSB 
containing 1% BSA before use) is added to each well and 
incubated at 37°C for 30 min. The wells are then washed 
four times with BSB containing 0.1% Tween 20. 100 ul 
strepavidin-alkaline (Southern Biotechnology Associates, 
Inc. Birmingham, Alabama, diluted 1:2000 in BSB containing 
0.1% Tween 20 before use) is added to each well and 
incubated at 37 °C for 30 min. The plates are washed four 
times with 0.5M Tris buffered Saline (TBS), pH 7.2. 50ul 
substrate (ELISA Amplification System Kit, Life 
Technologies, Inc., Bethesda, MD) are added to each well 
incubated at room temperature for 15 min. Then, 50 ul 
amplifier (from the same amplification system kit) is added 
and incubated for another 15 min at room temperature. The 
reaction is ^stopped by the addition of 50 ul 0.3 M 
sulphuric acid. The OD at 490 nm of the solution in each 
well is recorded. To guantitate OP-1 in culture media, a 
pP-1 standard curve is performed in parallel with the test 
samples • 
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6. Pr paration f P Ivcl nal Antibody 

Polyclonal antibody is prepared as follows * Bach 
rabbit is given a primary immunization of 100 ug/500 ul 
B. coli- prodttced OP-1 monomer (amino acids 328-431 of SEQ. 
ID NO: 11) in 0.11 SDS mixed with 500 ul Complete Freund's 
Adjuvant* The antigen is injected subcutaneous ly at 
multiple sites on the back and flanks of the animal. The 
rabbit is boosted after a month in the same manner using 
incomplete Freund's Adjuvant. Test bleeds are taken from 
the ear vein seven days later. Two additional boosts and 
test bleeds are performed at monthly intervals until 
antibody against OP-1 is detected in the serum using an 
ELISA assay* Then, the rabbit is boosted monthly with 100 
ug of antigen and bled (15 ml per bleed) at days seven and 
ten after boosting. 

7. Preparation of Monoclonal Antibody and Neutralizing Mon 

oclonal Antibody 

Monoclonal antibody specific for a given morphogen 
may be prepared as follows. A mouse is given two 
injections of E. coli produced OP-1 monomer (amino acids 
328-431 in SEQ ID NO:ll>. The first injection, contains 
lOOug of OP-1 in complete Freund's adjuvant and is given 
subcutaneously. The second injection contains 50 ug of OP- 
1 in incomplete adjuvant and is given intraperitoneally. 
The mouse then receives a total of 230 ug of OP-1 (amino 
acids 307-431 of SEQ ID NO: 11) in four intraperitoneal 
injections at various times over an eight month period. 
One week prior to fusion. The mouse is boosted 
intraperitoneally with 100 ug of OP-1 (15-139) and 30 ug of 
the N-terminal peptide (Ser293-Asn309-Cys) conjugated 
through the added cys residue to bovine serum albumin with 
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SHCC cr sslinklng agent. This boost is repeated five days 
(IP), four days (IP)., three days (IP) and one day (IV) 
prior to fusion. The mouse spleen cells are then fused to 
myeloma (e.g., 653) cells at a ratio of 1:1 using PEG 15Q0 
(Boehringer Mannheim), and the cell fusion is plated and 
screened for OP-l-specific antibodies using OP-1 ( 307-43 1) 
as antigen. The cell fusion and monoclonal screening are 
according to procedures widely available in the art. The 
neutralizing monoclonal is identified by its ability to 
block the biological activity of OP-1 when added to a 
cellular assay which responds biologically to added OP-1. 

8. Identification of OP-1 Producing Cell Line Which 
Displays OP-1 Surface Receptors 

During the process of routinely testing the effects 
of increasing concentrations of OP-1 and TGF-B on the 
proliferation of various cell lines, a cell line was 
identified which, surprising, appears not only to 
synthesize and secrete OP-1, but also to display cell 
surface receptors to which the secreted OP-1 binds and acts 
to inhibit proliferation of the cells. This cell line was 
identified after the following observations. Addition of 
increasing concentrations of OP-1 or TGF-B failed to 
increase or decrease the relatively low basal rate of 
proliferation of the cells. However, addition of a 
monoclonal antibody, which neutralizes the activity of 
Op-1, resulted in a large increase in the proliferation 
of the cells. In addition, simultaneous addition of 
the same quantity of OP-1 neutralizing monoclonal to a 
fixed amount of OP-1 resulted in an increase in 
proliferation which was intermediate between the low 
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basal level observ d with OP-1 alone and th high 1 v 1 
observed with the monoclonal alone. This cell line, 
which is an epithelial cell line that was derived from a 
bladder cell carcinoma, may be used in an assay of the 
invention. The parameter to be tested according to the 
invention is cellular proliferation. Thus, a compound! s) 
that increases or decreases the level of OP-1 production 
may be tested on this cell line as follows.. 
9. Assay for Identifying Drugs Which Affect OP-1 Synthesi 

A simple medium flux screening assay can be configured 
in a standard 24 or 96 well microtiter dishe, in which each 
well contains a constant number of a cell line having the 
characteristics described above. Increasing concentrations 
of an OP-I neutralizing monoclonal antibody is added from 
left to right across the dish. A constant amount of 
different test substances is added from top to bottom on 
the dish. An increase in the synthesis and secretion of 
OP-1 (over its constitutive (non-induced) level) will be 
indicated by an increase in the amount of OP-1 neutralizing 
antibody required to release the cells from the 
antimitogenic activity of OP-1. A decrease in the 
synthesis and secretion of OP-1 (below its constitutive 
(repressed) level) will be indicated by the observation 
that decreased concentrations of the OP-1 neutralizing 
monoclonal antibody will be required to release the cells 
from the antimitogenic activity of OP-1. One of the major 
advantages of this assay is that the end point, i.e., the 
•dilution of .antibody which has an effect on cell 
proliferation, is a measure of mitosis, or an increase in 
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the number of cells per well. Because several c nvenlent 
and rapid assays exist for quantitating cell numbers, this 
assay is faster and requires significantly fewer steps to 
perform. 

The assay may be performed as follows. After addition 
of appropriate concentrations of the OP-1 neutralizing 
monoclonal antibody and test substances to the wells 
containing the cells, the dishes are placed in an incubator 
at 37° C for a period of 1-3 days. After completion of 
incubation/growth period, the dishes are removed and the 
cells in the individual wells are washed and stained with a 
vital stain, such as crystal violet. Washing and staining 
procedures are well-known in the art. The cells are then 
lysed and the stain dissolved in a constant amount of a 
solvent, such as ethanol. Quantitations of the dissolved 
stain, which is readily performed on an automated plate 
vendor, allows for direct quantitation of the number of 
cells in each well. 

The above-described assay has the advantages of being 
rapid and easy to perform becaue it requires few steps. 
Another advantage is intrinsic to the assay; drugs which 
are screened according to this procedure that result in 
cell death (i.e., cytotoxic substances) are immediately, 
identifiable without the need of operator observation. In 
addition, although drugs that stop the growth of the cells 
(i.e., cytostatic substances) are scored as positive due to 
failure to see increases in cell numbers, they are 
automatically scored as suspect due to the failure of the 
highest concentrations of OP-1 neutralizing monoclonal 
antibody to release the cells from the antimitogenic 
activity of OP-1. 
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10. Candidate Drugs to Scr en 

The screening methods of the invention is used to test 
compounds for their effect on the production of morphogenic 
protein by a given cell type. Examples of compounds which 
may be screened include but are not limited to chemicals, 
biological response modifiers {e.g*, lymphokines, 
cytokines, hormones r or vitamins ), plant extracts, 
microbial broths and extracts medium conditioned by 
eukaryotic cells, body fluids, or tissue extracts. 

The invention may be embodied in other specific forms 
without departing from the spirit or essential 
characteristics thereof. The present embodiments are 
therefore to be considered in all respects as illustrative 
and not restrictive, the scope of the invention being 
indicated by the appended claims rather than by the 
foregoing description, and all changes which come within 
the meaning and range of equivalency of the claims are 
therefore intended to be embraced therein. 



WO 93/05172 



67 



PCT/US92/07359 



SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: John Smart 

Herman. Oppennann 
Engin Ozkaynak 
Thangavel Kuberasampath 
David C. Rueger 
Roy H.L. Pang 
Charles M. Cohen 

(ii) TITLE OF INVENTION: MORPHOGENIC 
PROTEIN SCREENING METHOD 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Creative BioMolecules 

(B) STREET: 35 South Street 

(C) CITY: Hopkinton 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 01748 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 5.25, 

360kb storage 

(B) COMPUTER: IBM XT 

(C) OPERATING SYSTEM: DOS 3.30 

(D) SOFTWARE: ASC II TEXT 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 667,274 

(B) FILING DATE: March 11, 1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 752,861 

(B) FILING DATE: AUGUST 30, 1991 
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(viii) ATTORNEY/AGENT INFORMATION 

(A) NAME: PITCHER, EDMUND R. 

(B) REG. NO. : 27,829 ^ 

(C) DOCKET NO.: CRP-OSS^C 

(ix) TELEPHONE: 

(A) 617/248-7000 

(B) TELEFAX: 617/248-7100 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 1 

(D) OTHER INFORMATION: Each Xaa 
indicates one. of the 20 naturally- 
occurring L-isomer, a-amino acids 
or a derivative thereof. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

10 15 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 

20 25 
Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

30 35 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
40 45 50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

85 90 
Xaa Cys Xaa 
95 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 2 

(D) OTHER INFORMATION: Bach Xaa 
indicates one of the 20 naturally- 
occurring L-isomer, a-amino acids 
or a derivative thereof. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

10 15 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa xaa Xaa 

20 25 
Cys Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa 

30 35 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa xaa 
40 45 50 

Xaa Xaa Xaa Xaa xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

• 75 80 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

85 90 
Xaa Cys Xaa 
95 
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(2) INFORMATION FOR SBQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 3 

(D) OTHER INFORMATION: wherein each 
Xaa is independently selected from 
a group of one or more specified 
amino acids as defined in the 
specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Leu Tyr Val Xaa Fhe 
1 5 
Xaa xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

10 

Xaa Ala Pro Gly Xaa Xaa Xaa Ala 

15 20 
Xaa. Tyr Cys Xaa Gly Xaa Cys Xaa 
25 30 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

35 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
40 45 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

50 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

55 60 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
65 
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Xaa Xaa Xaa L u Xaa Xaa Xaa 

70 75 
Xaa Xaa xaa Xaa Val Xaa Leu Xaa 
80 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

85 90 
Xaa Cys Gly Cys Xaa 
95 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iz) FEATURE: 

(A) NAME: Generic Sequence 4 

(D) OTHER INFORMATION: wherein each 
Xaa is independently selected from 
a group of one or more specified 
amino acids as defined in the 
specification » 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Cys Xaa Xaa Xaa Xaa Leu Tyr Val Xaa Phe 
1 5 10 

Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 

15 

Xaa Ala Pro Xaa Gly Xaa Xaa Ala 

20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
30 35 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 

40 
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Asn Xaa Xaa Asn His Ala Xaa Xaa 
45 50 
Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 

55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Met Xaa Val Xaa 

90 95 
Xaa Cys Gly Cys Xaa 
100 



(2) INFORMATION FOR SBQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: hOP-1 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 J 

Ser Thr Gly Ser Lys Gin Arg Ser Gin 

1 5 
Asn Arg Ser Lys Thr Pro Lys Asn Gin 

10 15 
Glu Ala Leu Arg Met Ala Asn Val Ala 

20 25 
Glu Asn Ser Ser Ser Asp Gin Arg Gin 
30 35 
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Ala Cys Lys Lys His 
40 

Ser Phe Arg Asp Leu 

50 

Trp He He Ala Pro 
55 

Ala Tyr Tyr Cys Glu 
65 

Phe Pro Leu. Asn Ser 
75 

Thr Asn His Ala lie 
85 

Val His Phe He Asn 

95 

Pro Lys Pro Cys Cys 
100 

Leu Asn Ala lie Ser 
110 

Asp Asp Ser Ser Asn 
120 

Lys Tyr Arg Asn Met 
130 

Cys Gly Cys His 
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Glu Leu Tyr Val 
45 

Gly Trp Gin Asp 

Glu Gly Tyr Ala 
60 

Gly Glu Cys Ala 
70 

Tyr Met Asn Ala 
80 

Val Gin Thr Leu 
90 

Pro Glu Thr Val 

Ala Pro Thr Gin 
105 

Val Leu Tyr Phe 
115 

Val He Leu Lys 
125 

Val Val Arg Ala 
135 
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INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ixj FEATURE: 

(A) NAME: mOP-1 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Ser Thr Gly Gly Lys 
1 5 
Asn Arg Ser Lys Thr 
10 

Glu Ala Leu Arg Met 
20 

Glu Asn Ser Ser Ser 
30 

Ala Cys Lys Lys His 
40 

Ser Phe Arg Asp Leu 

50 

Trp lie lie Ala Pro 
55 

Ala Tyr Tyr Cys Glu 
65 

Phe Pro Leu Asn Ser 
75 

Thr Asn His Ala He 
85 

Val His Phe He Asn 

95 

Pro Lys Pro Cys Cys 
100 

Leu Asn Ala He Ser 
110 



Gin Arg Ser Gin 

Pro Lys Asn Gin 
15 

Ala Ser Val Ala 
25 

Asp Gin Arg Gin 
35 

Glu Leu Tyr Val 
45 

Gly Trp Gin Asp 

Glu Gly Tyr Ala 
60 

Gly Glu Cys Ala 
70 

Tyr Met Asn Ala 
80 

Val Gin Thr Leu 
90 

Pro Asp Thr Val 

Ala Pro Thr Gin 
105 

Val Leu Tyr Phe 
115 
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Asp Asp Ser Ser Asn Val lie Leu Lys 

120 125 
Lys Tyr Arg Asn Met Val Val Arg Ala 
130 135 
Cys Gly Cys His 

(2) INFORMATION FOR SEQ ID NO: 7 I 

(i) SEQUENCE CHARACTERISTICS: 

(A) , LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: hOP-2 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7s 

Ala Val Arg Pro Leu Arg Arg Arg Gin 

1 5 
Pro Lys Lys Ser Asn Glu Leu Pro Gin 

10 15 

Ala Asn Arg Leu Pro Gly lie Phe Asp 

20 25 

Asp Val His Gly Ser His Gly Arg Gin 

30 35 

Val Cys Arg Arg His Glu Leu Tyr Val 

40 4 5 

Ser Phe Gin Asp Leu Gly Trp Leu Asp 

50 

Trp Val lie Ala Pro Gin Gly Tyr Ser 

55 60 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ser 

65 70 
Phe Pro Leu Asp Ser Cys Met Asn Ala 

75 80 
Thr Asn His Ala lie Leu Gin Ser Leu 
85 90 



WO 93/05172 



PCT/US92/07359 



77 



Val His Leu Met Lys Pro Asn Ala Val 

95 

Pro Lys Ala Cys Cys Ala Pro Thr Lys 

100 105 

Leu Ser Ala Thr Ser Val Leu Tyr Tyr 

110 115 
Asp Ser Ser Asn Asn Val He Leu Arg 

120 125 
Lys His Arg Asn Met Val Val Lys Ala 
130 135 
Cys Gly Cys .His 

(2) INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: mOP-2 (mature form) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Ala Ala Arg Pro 
1 

Pro Lys Lys Thr 
10 

Pro Asn Lys Leu 
20 

Asp Gly His Gly 
30 

Val Cys Arg Arg 
40 

Ser Phe Arg Asp 

Trp val lie Ala 

55. 



Leu Lys Arg Arg Gin 
5 

Asn Glu Leu Pro His 
15 

Pro Gly He Phe Asp 
25 

Ser Arg Gly Arg Glu 
35 

His Glu Leu Tyr Val 

45 

Leu Gly Trp Leu Asp 
50 

Pro Gin Gly Tyr Ser 
60 
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Ala Tyr Tyr Cys Glu 
65 

Phe Pro Leu Asp Ser 
75 

Thr Asn His Ala He 
85 

Val His Leu Met Lys 

95 

Pro Lys Ala Cys Cys 
100 

Leu Ser Ala Thr Ser 
110 

Asp Ser Ser Asn Asn 
120 

Lys His Arg Asn Met. 

130 

Cys Gly Cys His 



Gly Glu Cys Ala 
70 

Cys Met Asn Ala 
80 

Leu Gin Ser Leu 
90 

Pro Asp Yal Val 

Ala Pro Thr Lys 
105 

Val Leu Tyr Tyr 
115 

Val He Leu Arg 
125 

Val Val Lys Ala 
135 
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(2) INFORMATION FOR SBQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LBNGTH: 96 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: CBMP-2A(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser 

15 10 
Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro 

15 .. 20 

Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu 

25 30 
Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser 

35 40 
Thr Asn His Ala He Val Gin Thr Leu Val Asn 
45 50 55 

Ser Val Asn Ser Lys He Pro Lys Ala Cys Cys 

60 65 
Val Pro Thr Glu Leu Ser Ala He Ser Met Leu 

70 75 
Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys 

80 85 
Asn Tyr Gin Asp Met Val Val Glu Gly Cys Gly 

90 95 
Cys Arg 
100 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 L amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: CBMP-2B(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10r 

Cys Arg Arg His Ser 
1 5 
Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 

10 15 
Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala 

20 25 
Phe Tyr Cys His Gly Asp Cys Pro Phe Pro Leu 

30 35 
Ala Asp His Leu Asn Ser Thr Asn His Ala lie 

40 45 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser 
50 55 60 

lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu 

65 70 
Ser Ala He Ser Met Leu Tyr Leu Asp Glu Tyr 

75 80 
Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 

85 90 
Val Val Glu Gly Cys Gly Cys Arg 
95 100 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: DPP(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser 

1 5 10 

Asp Val Gly Trp Asp Asp Trp lie Val Ala Pro 

15 20 
Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly Lys 

25 30 
Cys Pro Phe Pro Leu Ala Asp His Phe Asn Ser 

35 40 
Thr Asn Bis Ala Val Val Gin Thr Leu Val Asn 
45 50 55 

Asn Asn Asn Pro Gly Lys Val Pro Lys Ala Cys 

60 65 
Cys Val Pro Thr Gin Leu Asp Ser Val Ala Met 

70 75 
Leu Tyr Leu Asn Asp Gin Ser Thr Val Val Leu 

80 85 
Lys Asn Tyr Gin Glu Met Thr Val Val Gly Cys 

90 95 
Gly Cys Arg 
100 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Vgl(fx) 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 12: 

Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys 

1 5 10 

Asp Val Gly Trp Gin Asn Trp Val He Ala Pro 

15 20 
Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly Glu 

25 30 
Cys Pro Tyr Pro Leu Thr Glu He Leu Asn Gly 

35 40 
Ser Asn His Ala He Leu Gin Thr Leu Val His 
45 50 55 

Ser He Glu Pro Glu ASp He Pro Leu Pro Cys 

60 65 
Cys Val Pro Thr Lys Met Ser Pro He Ser Met 

70 75 
Leu Phe Tyr Asp Asn Asn Asp Asn Val Val Leu 

80 85 
Arg His Tyr Glu Asn Met Ala Val Asp Glu Cys 

90 95 
Gly Cys Arg 
100 
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(2) INFORMATION FOR SBQ ID NOil3x 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH x 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Vgr-l(fx) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Gin 

15 10 
Asp Val Gly Trp Gin Asp Trp lie He Ala Pro 

15 20 
Xaa Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu 

25 30 
Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala 

35 40 
Thr Asn His Ala He Val Gin Thr Leu Val Bis 
45 50 . 55 

Val Met Asn Pro Glu Tyr Val Pro Lys Pro Cys 

60 65 
Cys Ala Pro Thr Lys Val Asn Ala He Ser Val 

70 75 
Leu Tyr Phe Asp Asp Asn Ser Asn Val He Leu 

80 85 
Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys 

90 95 
Gly Cys His 
100 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPES protein 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM* human 
(F) TISSUE TIPS: BRAIN 

(ix) FEATURE: 
(D) OTHER INFORMATION: 

/product- "GDF-I (frj" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: Ui 

Cys Are Ala Are Arg Leu Tyr Val Ser Phe Arg Glu Val Gly 
1 5 10 

Trp His Arg Trp Val lie Ala Pro Arg Gly Phe Leu Ala Asn Tyr 
15 20 25 

Cys Gin Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly 
30 35 40 

Gly Pro Pro Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His 
45 50 55 

Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala 
60 65 70 

Are Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn Ser Asp Asn 
75 80 85 

Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu Cys Gly 
90 ' 95 100 

Cys Arg 
105 
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(2) INFORMATION FOR SEQ ID HO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 5 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION j SEQ ID NO: 15: 

Cys Xaa Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

(A) NAME/KEY:. CDS 

(B) LOCATION: 49.. 1341 

(D) OTHER INFORMATION: /standard_name= "hOPl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 57 

Met His Val 
1 

CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA 105 
Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala 
. 5 10 15 

CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC 153 
Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn 
20 25 30 35 

GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG 201 
Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg 
40 45 50 

CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC 249 
Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg 
55 60 65 
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CCG CGC CCG CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TIC ATG 
Pro Are Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Pbe Het 
70 75 80 

CTG GAC GIG TAG AAC GCC ATG GCG GIG GAG GAG GGC GGC GGG CCC GGC 
Leu Asp Leu Tyr Asn Ala Het Ala Val Glu Glu Gly Gly Gly Pro Gly 
85 90 95 

GGC CAG GGC TTC TCC IAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC 
Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly 
100 105 110 115 

CCC CCT CTG GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC 
Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Tbr Asp Ala Asp 
120 125 130 

ATG GTC ATG AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC 
Met Val Het Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Pbe Phe 
135 140 145 

CAC CCA CGC TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC 
His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie 
150. 155 160 

CCA GAA GGG GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC 
Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Asp 
165 170 175 

TAC ATC CGG GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GIT TAT 
Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg lie Ser Val Tyr 
180 185 190 195 

CAG GTG CTC CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC 
Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu 
200 205 210 

GAC AGC CGT ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC 
Asn Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp 
215 220 225 

ATC ACA GCC ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG 
lie Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu 
230 235 240 

GGC CTG CAG CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC 
Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser lie Asn Pro 
245 250 255 



AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC 
Lys Leu Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn Lys Gin Pro 
260 265 270 275 



297 



345 



393 



441 



489 



537 



585 



633 



681 



729 



777 



825 



873 
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TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC CAC TIC CGC AGC ATC 921 
Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe Axg Ser lie 
280 285 290 

CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC 969 
Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro 
295 300 305 

AAG AAC CAG GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC 1017 
Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser 
310 315 320 

AGC GAC CAG AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC 1065 
Ser Asp Gin Arg Gin Ala Cys Lys Lys Bis Glu Leu Tyr Val Ser Phe 
325 330 335 

CGA GAC CTG GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC 1113 
Arg Asp Leu Gly Trp Gin Asp Trp lie lie Ala Pro Glu Gly Tyr Ala 
340 345 350 355 

GCC TAC TAC TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG 1161 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met 
360 365 370 

AAC GCC ACC AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC 1209 
Asn Ala Thr Asn His Ala lie Val Gin Thr Leu Val His Phe He Asn 
375 380 385 

CCG GAA ACG GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC 1257 
Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala 
390 395 400 

ATC TCC GTC CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA 1305 
He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys 
405 410 415 

TAC AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 1351 
Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 



GAGAATTCAG 


ACCCTTTGGG 


GCCAAGTTTT 


TCTGGATCCT CCATTGCTCG 


CCTTGGCCAG 


1411 


GAACCAGCAG 


ACCAACTGCC 


TTTTGTGAGA 


CCTTCCCCTC CCTATCCCCA 


ACTTTAAAGG 


1471 


TGTGAGAGTA 


TTAGGAAACA 


TGAGCAGCAT 


ATGGCTTTTG ATCAGTTTTT 


CAGTGGCAGC 


1531 


ATCCAATGAA 


CAAGATCCTA 


CAAGCTGTGC 


AGGCAAAACC TAGCAGGAAA 


AAAAAACAAC 


1591 


GCATAAAGAA 


AAATGGCCGG 


GCCAGGTCAT 


TGGCTGGGAA GTCTCAGCCA 


TGCACGGACT 


1651 


CGTTTCCAGA 


GGTAATTATG 


AGCGCCTACC 


AGCCAGGCCA CCCAGCCGTG 


GGAGGAAGGG 


1711 


GGCGTGGCAA 


GGGGTGGGCA 


CATTGGTGTC 


TGTGCGAAAG GAAAATTGAC 


CCGGAAGTTC 


1771 


CTGTAATAAA 


TGTCACAATA 


AAACGAATGA 


ATGAAAAAAA AAAAAAAAAA 


A 


1822 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 431 amino acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) HOLECULE TYPE: protein • 

fix) FEATURE: 

(D) OTHER INFORMATION: /Product="OPl-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
15 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn GIu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu lie Leu Ser lie Leu Gly Leu 
50 55 60 

Pro His Are Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 , 70 75 80 

Het Phe Het Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 
85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 HO 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

Ser Lys lie Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie 
165 170 175 

Tyr Lys Asp Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 
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Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
210 215 220 

Val Phe Asp lie Thr Ala Thr Ser Asn His Trp Val Val Asn Fro Arg 
225 230 235 240 

His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 
245 250 255 

lie Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn 
260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 320 

Asn- Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp lie lie Ala Pro Glu 
340 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 

Ser Tyr Het Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 
385 390 395 400 

Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 
405 410 415 



Leu Lys Lys Tyr Arg Asn Het Val Val Arg Ala Cys Gly Cys His 
420 425 430 
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(2) INFORMATION FOR SEQ ID N0tl8: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LEKGTH: 1873 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1393 

(D) OTHER INFORMATION: /note- "HOP! (CDNA)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTGCAGCAAG TGACCTCGGG TCGTGGACCG CTGCCCTGCG CCCTCCGCTG CCACCTGGGG 60 

CGGCGCGGGC CCGGTGCCCC GGATCGCGCG TAGAGCCGGC GCG ATG CAC GTG CGC 115 

Met His Val Arg 
1 

TCG CTG CGC GCT GCG GCG CCA CAC AGC TTC GTG GCG CTC TGG GCG CCT 163 
Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro 
5 10 15 20 

CTG TTC TIG CTG CGC TCC GCC CTG GCC GAT TTC AGC CTG GAC AAC GAG 211 
Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu 
25 30 35 

GTG CAC TCC AGC TTC ATC CAC GGG CGC CTC CGC AGC CAG GAG CGG CGG 259 
Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser Glu Glu Arg Arg 
40 45 50 

GAG ATG CAG CGG GAG ATC CTG TCC ATC TTA GGG TTG CCC CAT CGC CCG 307 
Glu Met Gin Arg Glu lie Leu Ser He Leu Gly Leu Pro His Arg Pro 
55 r 60 65 

CGC CCG CAC CTC CAG GGA AAG CAT AAT TCG GCG CCC ATG TTC ATG TTG 355 
Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu 
70 75 80 

GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG AGC GGG CCG GAC GGA CAG 403 
Asp Leu Tyr Asn Ala Het Ala Val Glu Glu Ser Gly Pro Asp Gly Gin 
85 90 95 100 
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GGC TTC TCC TAC CCC TAG AAG GCC GTC TTC AGT ACC GAG GGC CCC CCT 451 
Gly Phe Ser TVr Pro TVr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro 
105 110 115 

TTA GCC AGC CTG CAG GAG AGC CAT TTC CTC ACT GAC GCC GAC ATG GTC 499 
Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val 
120 125 130 

ATG AGC TTC GTC AAC CTA GIG GAA CAT GAC AAA GAA TTC TTC CAC CCT 547 
Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro 

135 140 145 

CGA TAC CAC CAT CGG GAG TTC CGG TTT GAT CTT TCC AAG ATC CCC GAG 595 
Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He Pro Glu 
150 155 160 

GGC GAA CGG GTG ACC GCA GCC GAA TTC AGG ATC TAT AAG GAC TAC ATC 643 
Gly Glu Arg Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr He 
165 170 175 180 

CGG GAG CGA TTT GAC AAC GAG ACC TTC CAG ATC ACA GTC TAT CAG GTG 691 
Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin He Thr Val TJrr Gin Val 
185 190 195 

CTC CAG GAG CAC TCA GGC AGG GAG TCG GAC CTC TTC TTG CTG GAC AGC 739 
Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser 
200 205 210 

CGC ACC ATC TGG GCT TCT GAG GAG GGC TGG TTG GTG TTT GAT ATC ACA 787 
Arg Thr He Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp He Thr 
215 220 225 

GCC ACC AGC AAC CAC TGG GTG GTC AAC CCT CGG CAC AAC CTG GGC TTA 835 
Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu 
230 235 240 

CAG CTC TCT GTG GAG ACC CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG 883 
Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro Lys Leu 
245 7 250 255 260 

GCA GGC CTG ATT GGA CGG CAT GGA CCC CAG AAC AAG CAA CCC TTC ATG 931 
Ala Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met 
. 265 270 275 

GTG GCC TTC TTC AAG GCC AGG GAA GTC CAT CTC CGT AGT ATC CGG TCC 979 
Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg Ser He Arg Ser 
280 285 290 

ACG GGG GGC AAG CAG CGC AGC CAG AAT CGC TCC AAG ACG CCA AAG AAC 1027 
Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn 
295 300 305 

CAA GAG GCC CTG AGG ATG GCC AGT GTG GCA GAA AAC AGC AGC AGT GAC 1075 
Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn Ser Ser Ser Asp 
310 315 320 



i 
i 
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CAG AGG CAG GCC TGC AAG AAA CAT GAG CTG XAC GTC AGC TTC OGA GAC 1123 
Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser She Arg Asp 
325 330 335 340 

CTT GGC TGG CAG GAC TGG ATC ATT GCA CCT GAA GGC TAX GCX GCC TAC 1171 
Leu Gly Trp Gin Asp Trp He lie Ala Pro Glu Gly Tyr Ala Ala Tyr 
345 350 355 

TAC TGT GAG GGA GAG TGC GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC 1219 
Tvr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala 
3 360 365 370 

ACC AAC CAC GCC ATC GTC CAG ACA CTG GTT CAC TTC ATC AAC CCA GAC 1267 
Thr Asn His Ala lie Val Gin Thr Leu Val His Phe lie Asn Pro Asp 
375 380 385 

ACA GTA CCC AAG CCC TGC TGT GCG CCC ACC CAG CTC AAC GCC ATC TCI 1315 
Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala lie Ser 
390 395 400 

GTC CTC TAC TIC GAC GAC AGC TCT AAT GTC ATC CTG AAG AAG TAC AGA 1363 
Val Leu Tyr Phe Asp Asp Ser Ser Asn Val lie Leu Lys Lys Tyr Arg 
405 410 415 420 

AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCITCC TGAGACCCTG 1413 
Asn Met Val Val Arg Ala Cys Gly Cys His 
425 430 

1473 
1533 
1593 
1653 
1713 
1773 
1833 
1873 



ACCTTIGCGG 


GGCCACACCT 


TTCCAAATCT 


TCGATGTCTC 


ACCATCTAAG 


TCTCTCACTG 


CCCACCTTGG 


CGAGGAGAAC 


AGACCAACCT 


CTCCTGAGCC 


TTCCCTCACC 


TCCCAACCGG 


AAGCATGTAA 


GGGTTCCAGA 


AACCTGAGCG 


TGCAGCAGCT 


GATGAGCGCC 


CTTTCCTTCT 


GGCACGTGAC 


GGACAAGATC 


CTACCAGCTA 


CCACAGCAAA 


CGCCTAAGAG 


CAGGAAAAAT 


GTCTGCCAGG 


AAAGTGTCCA 


GTGTCCACAT 


GGCCCCTGGC 


GCTCTGAGTC 


TTTGAGGAGT 


AATCGCAAGC 


CTCGTTCAGC 


TGCAGCAGAA 


GGAAGGGCTT 


AGCCAGGGTG 


GGCGCTGGCG 


TCTGTGTTGA 


AGGGAAACCA 


AGCAGAAGCC 


ACTGTAATGA 


TATGTCACAA 


TAAAACCCAT 


GAATGAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


AAAAGAATTC 
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20) INFORMATION FOR SEQ ID NOtl9« 

(i) SEQUENCE CHARACTERISTICS > 

(A) LENGTH* 430 amino acids 

(B) TYPE i amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(D) OTHER INFORMATION: /product- "mOPl-PP" 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 19: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
15 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu lie Leu Ser lie Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly 
85 90 95 

Pro Asp Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr 
100 105 110 

Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp 
115 120 125 

Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu 
130 135 140 

Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser 
1A5 150 155 160 

Lys lie Pro Glu Gly Glu Arg Val Thr Ala Ala Glu Phe Arg lie Tyr 
165 170 175 

Lys Asp Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin lie Thr 
180 185 190 

c 

Val Tyr Gin Val Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe 
195 200 205 
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Leu L u Asp S r Arg Thr H Trp Ala Ser Ghi Glu Gly Trp Leu Val 
210 215 220 

Phe Asp lie Ihr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His 



225 



230 



240 



Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser lie 
245 250 255 

Asn Pro Lys Leu Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn Lys 



260 



265 



Gin Pro Phe Het Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg 
275 280 285 

Ser lie Arg Ser Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys 
290 295 300 

Thr Pro Lys Asn Gin Glu Ala Leu Arg Het Ala Ser Val Ala Glu Asn 



305 



310 



315 



Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val 
325 330 335 

Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly 
340 345 350 

Tvr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser 
355 360 365 

Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe 
370 375 380 

He Asn Pro Asp Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu 
385 390 395 400 

Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu 
405 *10 «15 

Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 



420 



430 
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(2) INFORMATION FOR SEQ ID N0:20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECDLE TYPE: cDNA 

(vi)ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: HIPPOCAMPUS 

(ix)FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 490.. 1696 

(D) OTHER INFORMATION: /note- "hOP2 (cDNA)" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GGCGCCGGCA 


GAGCAGGAGT 


GGCTGGAGGA GCTGTGGTTG GAGCAGGAGG 


TGGCACGGCA 


60 


GGGCTGGAGG 


GCTCCCTATG 


AGTGGCGGAG ACGGCCCAGG AGGCGCTGGA 


GCAACAGCTC 


120 


CCACACCGCA 


CCAAGCGGTG 


GCTGCAGGAG CTCGCCCATC GCCCCTGCGC 


TGCTCGGACC 


180 


GCGGCCACAG 


CCGGACTGGC 


GGGTACGGCG GCGACAGAGG CATTGGCCGA 


GAGTCCCAGT 


240 


CCGCAGAGTA 


GCCCCGGCCT 


CGAGGCGGTG GCGTCCCGGT CCTCTCCGTC 


CAGGAGCCAG 


300 


GACAGGTGTC 


GCGCGGCGGG 


GCTCCAGGGA CCGCGCCTGA GGCCGGCTGC 


CCGCCCGTCC 


360 


CGCCCCGCCC 


CGCCGCCCGC 


CGCCCGCCGA GCCCAGCCTC CTTGCCGTCG 


GGGCGTCCCC 


420 


AGGCCCTGGG 


TCGGCCGCGG 

i 


AGCCGATGCG CGCCCGCTGA GCGCCCCAGC 


TGAGCGCCCC 


480 


CGGCCTGCC ATG ACC GCG 
Met Thr Ala 


CTC CCC GGC CCG CTC TGG CTC CTG GGC CTG 
Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu 


528 



I 5 10 

GCG CTA TGC GCG CTG GGC GGG GGC GGC CCC GGC CTG CGA CCC CCG CCC 576 
Ala Leu Cys Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro 
15 20 25 

GGC TGT CCC CAG CGA CGT CTG GGC GCG CGC GAG CGC CGG GAC GIG CAG 624 
Gly Cys Fro Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin 
30 35 40 45 

CGC GAG ATC CTG GCG GTG CTC GGG CTG CCT GGG CGG CCC CGG CCC CGC 672 
Arg Glu lie Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg 
50 55 60 
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GCG CCA. CCC GCC GCC TCC CGG CTG CCC GCG TCC GCG CCG CXC TTC ATG 
Ala Pro Pro Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Het 
65 70 75 

CTG GAC CTG TAG CAC GCC ATG GCC GGC GAC GAC GAC GAG GAC GGC GCG 
Lea Asp Leu Tyr His Ala net Ala Gly Asp Asp Asp Glu Asp Gly Ala 
80 85 90 

CCC GCG GAG CGG CGC CTG GGC CGC GCC GAC CTG GTC ATG AGC TTC GTT 
Pro Ala Glu Arg Arg Leu Gly Arg Ala Asp Leu Val Het Ser Phe Val 
95 100 105 

AAC ATG GIG GAG CGA GAC CGT GCC CTG GGC CAC CAG GAG CCC CAT TGG 
Asn Het Val Glu Arg Asp Arg Ala Leu Gly His Gin Glu Pro His Trp 
110 115 120 125 

AAG GAG TTC CGC TTl GAC CTG ACC CAG AIC CCG GCT GGG GAG GCG GTC 
Lvs Glu Phe Arg Phe Asp Leu Thr Gin lie Pro Ala Gly Glu Ala Val 
7 130 135 140 

ACA GCT GCG GAG TTC CGG ATI TAC AAG GIG CCC AGC AIC CAC CTG CTC 
Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser lie His Leu Leu 
145 150 155 

AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG GIG GTC CAG GAG CAG TCC 
Asn Arg Thr Leu His Val Ser Het Phe Gin Val Val Gin Glu Gin Ser 
160 165 170 

AAC AGG GAG' TCT GAC TTG TTC TIT TIG GAT CTT CAG ACG CTC CGA GCT 
Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala 
175 180 185 

GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC ACA GCA GCC ACT GAC TGC 
Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys 
190 195 200 205 

TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA CTC CGC CTC TAT GIG GAG 
Trp Leu Leu Lys Arg His' Lys Asp Leu Gly Lett Arg Leu Tyr Val Glu 
210 215 220 

ACT GAG GAC GGG CAC AGC GTG GAT CCT GGC CTG GCC GGC CTG CTG GGT 
Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly 
225 230 235 

CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC GTG GTC ACT TTC TTC AGG 
Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg 
240 245 250 

GCC AGT CCG ACT CCC AIC CGC ACC CCT CGG GCA CTG AGG CCA CTG AGG 
Ala Ser Pro Ser Pro lie Arg Thr Pro Arg Ala Val Arg Pro Leu Arg 
255 260 265 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 
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AG6 AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG GCC AAC CGA CTC 1344 
Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu 
270 275 280 285 

CCA GGG ATC ITT GAT GAC GIC CAC GGC ICC CAC GGC CGG CAG GTC TGC 1392 
Pro Gly lie Phe Asp Asp Val Bis Gly Ser His Gly Arg Gin Val Cys 
290 295 300 

CGT CGG CAC GAG CTC TAC GIC AGC TIC CAG GAC CTC GGC TGG CTG GAC 1440 
Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 
305 310 315 

TGG GTC ATC GCT CCC CAA GGC TAC ICG GCC TAT TAC TGT GAG GGG GAG 1488 
Trp Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu 
320 325 330 

TGC ICC TTC CCA CTG GAC TCC TGC ATG AAT GCC ACC AAC CAC GCC ATC 1536 
Cys Ser Phe Pro Leu Asp Ser Cys Het Asn Ala Thr Asn His Ala lie 
335 340 345 

CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG 1584 
Leu Gin Ser Leu Val His Leu Het Lys Pro Asn Ala Val Pro Lys Ala 
350 355 360 365 

TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC TCI GTG CTC TAC TAT GAC 1632 
Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp 
370 375 380 

AGC AGC AAC AAC GTC ATC CTG CGC AAA CAC CGC AAC ATG GTG GTC AAG 1680 
Ser Ser Asn Asn Val lie Leu Arg Lys His Arg Asn Het Val Val Lys 
385 390 395 

GCC TGC GGC TGC CAC T GAGTCAGCCC GCCCAGCCCT ACTGCAG 1723 
Ala Cys Gly Cys His 
400 
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(2) INFORMATION FOR SEQ ID N0i21: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TIPE: protein 

(ix) FEATURE: 

(A) OTHER INFORMATION; /product- n hOP2-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
1 5 10 15 

Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro Gly Cys Pro 
20 25 30 

Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin Arg Glu He 
35 40 45 

Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Pro Pro 
50 55 60 

Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu 
65 70 75 80 

Tyx His Ala Met Ala Gly Asp Asp Asp Glu Asp Gly Ala Pro Ala Glu 
85 90 95 

Ars Are Leu Gly Arg Ala Asp Leu Val Met Ser Phe Val Asn Met Val 
* * 100 105 HO 

Glu Are Aso Are Ala Leu Gly His Gin Glu Pro His Trp Lys Glu Phe 
115 120 125 

Arg Phe Asp Leu Thr Gin lie Pro Ala Gly Glu Ala Val Ihr Ala Ala 
130 135 140 

Glu Phe Arg lie Tyr Lys Val Pro Ser He His Leu Leu Asn Arg Thr 
145 150 155 160 

Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser Asn Arg Glu 
1&5 170 175 

Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala Gly Asp Glu 
180 185 190 

Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys Trp Leu Leu 
195 200 205 
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Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Glu Asp 
210 215 220 

Gly Bis Sex Val Asp Pro Gly Leu Ala Gly Leu Leu Gly Gin Arg Ala 
225 230 235 240 

Pro Arg Ser Gin Gin Pro Phe Val Val Thr Pbe Pbe Arg Ala Ser Pro 
245 250 255 

Ser Pro He Arg Thr Pro Arg Ala Val Arg Pro Leu Arg Arg Arg Gin 
260 265 270 

Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu Pro Gly lie 
275 280 285 

Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cys Arg Arg His 
290 295 300 

Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp Trp Val He 
305 310 315 320 

Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser Phe 
325 330 335 

Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala He Leu Gin Ser 
340 345 350 

Leu Val His Leu Het Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala 
355 , 360 365 

* 

Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Set Ser Asn 
370 375 380 

Asn Val lie Leu Arg Lys His Arg Asn Met Val Val Lys Ala Cys Gly 
385 390 395 400 

Cys His 



WO 93/05172 



100 



PCT/US92/07359 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D TOPOLOGY* linear 

(ii) MOLECULE TYPE: cDKA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISKi MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 93.. 1289 

(D) OTHER INFORMATION: /note- "«0P2 cDNA" 

(xi) SEQUENCE DESCRIPTION i SEQ ID N0:22: 

GCCAGGCACA GGTGCGCCGT CTGGTCCTCC CCGTCTGGCG TCAGCCGAGC 50 

CCGACCAGCT ACCAGTGGAT GCGCGCCGGC TGAAAGICCG AG ATG GCT ATG CGI 104 

Met Ala Met Axg 
1 

CCC GGG CCA CTC IGG CTA ITG GGC CTT GCT CTG TGC GCG CTG GGA GGC 152 
Pro Gly Pro- Leu Trp Leu Leu Gly Leu Ala Leu Cys Ala Leu Gly Gly 
5 10 15 20 

GGC CAC GGT CCG CGT CCC CCG CAC ACC TGT CCC CAG CGT CGC CTG GGA 
Gly His Gly Pro Arg Pro Pro His Thr Cys Pro Gin Arg Arg Leu Gly 
25 30 35 



200 



GCG CGC GAG CGC CGC GAC ATG CAG CGT GAA ATC CTG GCG GTG CTC GGG 248 
Ala Arg Gin Arg Arg Asp Met Gin Arg Glu lie Leu Ala Val Leu Gly 
40 45 50 

CTA CCG GGA CGG CCC CGA CCC CGT GCA CAA CCC GCG GCT GCC CGG CAG 296 
Leu Pro Gly Arg Pro Arg Pro Arg Ala Gin Pro Ala Ala Ala Arg Gin 
55 60 65 

CCA GCG TCC GCG CCC CTC TTC ATG TIG GAC CTA TAC CAC GCC ATG ACC 344 
Pro Ala Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr His Ala Met Thr 
70 75 80 

GAT GAC GAC GAC GGC GGG CCA CCA CAG GCT CAC TTA GGC CGT GCC GAC 392 
Asp Asp Asp Asp Gly Gly Pro Pro Gin Ala His Leu Gly Arg Ala Asp 
85 90 95 100 
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CTG GTC ATG AGC TTC GTC AAC ATG GTG GAA CGC GAC CGT ACC CTG GGC 440 
Leu Val Met Ser Phe Val Asn Het Val Glu Arg Asp Arg Thr Leu Gly 
105 110 115 

TAC CAG GAG CCA CAC TGG AAG GAA TTC CAC TIT GAC CIA ACC CAG ATC 488 
Tyr Gin Glu Pro His Trp Lys Glu Phe His Phe Asp Leu Thr Gin He 
120 125 130 

CCT GCT GGG GAG GCT GTC ACA GCT GCT GAG TTC CGG ATC TAC AAA GAA 536 
Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Glu 
135 140 145 

CCC AGC ACC CAC CCG CTC AAC ACA ACC CTC CAC ATC AGC ATG TTC GAA 584 
Pro Ser Thr His Pro Leu Asn Thr Thr Leu His lie Ser Het Phe Glu 
150 155 160 

GTG GTC CAA GAG CAC ICC AAC AGG GAG TCT GAC TIG TTC TIT TTG GAT 632 
Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp 
165 170 175 180 

CTT CAG ACG CTC CGA TCT GGG GAC GAG GGC TGG CTG GTG CTG GAC ATC 680 
Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Trp Leu Val Leu Asp He 
185 190 195 

ACA GCA GCC AGT GAC CGA TGG CTG CTG AAC CAT CAC AAG GAC CTG GGA 728 
Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His Lys Asp Leu Gly 
200 205 210 

CTC CGC CTC TAT GTG GAA ACC GCG GAT GGG CAC AGC ATG GAT CCT GGC 776 
Leu Arg Leu Tyr Val Glu Thr Ala Asp Gly His Ser Het Asp. Pro Gly 
215 220 225 

CTG GCT GGT CTG CTT GGA CGA CAA GCA CCA CGC TCC AGA CAG CCT TTC 824 
Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser Arg Gin Pro Phe 
230 235 240 

ATG GTA ACC TTC TTC AGG GCC AGC CAG AGT CCT GTG CGG GCC CCT CGG 872 
Het Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val Arg Ala Pro Arg 
245 250 255 260 

GCA GCG AGA CCA CTG AAG AGG AGG CAG CCA AAG AAA ACG AAC GAG CTT 920 
Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn Glu Leu 
( 265 270 275 

CCG CAC CCC AAC AAA CTC CCA GGG ATC ITT GAT GAT GGC CAC GGT TCC 968 
Pro His Pro Asn Lys Leu Pro Gly He Phe Asp Asp Gly His Gly Ser 
280 285 290 

CGC GGC AGA GAG GTT TGC CGC AGG CAT GAG CTC TAC GTC AGC TTC CGT 1016 
Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Arg 
295 300 305 
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GAC CTT GGC TGG CTG GAC TGG 
Asp Leu Gly Trp Leu Asp Trp 
310 315 

TAT IAC TGI GAG GGG GAG TGT 
Tyr Tyr Cys Glu Gly Glu Cys 
325 330 

GCC ACC AAC CAT. GCC AIC TIG 
Ala Thr Asn His Ala lie Leu 
345 

GAT GTT GTC CCC AAG GCA TGC 
Asp Val Val Pro Lys Ala Cys 
360 

TCT GTG CTG TAC TAT GAC AGC 
Ser Val Leu Tyr Tyr Asp Set 
375 

CGT AAC ATG GTG GTC AAG GCC 
Arg Asn Met Val Val Lys Ala 
390 395 

TGCTTCTACT ACCTTACCAT CTGGCCGGGC CCCTCTCCAG AGGCAGAAAC CCTICTATGT 1369 

TATCATAGCT CAGACAGGGG CAATGGGAGG CCCTTCACTT CCCCTGGCCA CTTCCTGCTA 1429 

AAATTCTGGT CTTTCCCAGT TCCTCTGTCC TTCATGGGGT TTCGGGGCTA TCACCCCGCC 1489 

CTCTCCATCC TCCTACCCCA AGCATAGACT GAATGCACAC AGCATCCCAG AGCTATGCTA 1549 

ACTGAGAGGT CTGGGGTCAG CACTGAAGGC CCACATGAGG AAGACTGATC CTTGGCCATC 1609 

CTCAGCCCAC AATGGCAAAT TCTGGATGGT CTAAGAAGGC CGTGGAATTC TAAACTAGAT 1669 

GATCTGGGCT CTCTGCACCA TTCATTGTGG CAGTTGGGAC ATTTTTAGGT ATAACAGACA 1729 

CATACACTTA GAICAATGCA TCGCTGTACT CCTTGAAATC AGAGCTAGCT TGTTAGAAAA 1789 

AGAATCAGAG CCAGGTATAG CGGTGCATGT CATTAATCCC AGCGCTAAAG AGACAGAGAC 1849 

AGGAGAATCT CTGTGAGTTC AAGGCCACAT AGAAAGAGCC TGTCTCGGGA GCAGGAAAAA 1909 

AAAAAAAAAC GGAATTC 1926 



GTC ATC GCC CCC CAG GGC IAC TCT GCC 1064 
Val lie Ala Pro Gin Gly Tyr Ser Ala 
320 

GCT TTC CCA CTG GAC TCC TGT ATG AAC 1112 
Ala Phe Pro Leu Asp Ser Cys Met Asn 
335 340 

CAG TCT CTG GTG CAC CTG ATG AAG CCA 1160 
Gin Ser Leu Val His Leu Met Lys Pro 
350 355 

TGT GCA CCC ACC AAA CTG AGI GCC ACC 1208 
Cys Ala Pro Thr Lys Leu Ser Ala Thr 
365 370 

AGC AAC AAT GTC ATC CTG CGT AAA CAC 1256 
Ser Asn Asn Val lie Leu Arg Lys His 
380 385 

TGT GGC TGC CAC TGAGGCCCCG CCCAGCATCC 1309 
Cys Gly Cys His 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(D) OTHER INFORMATION: /product- "ttOF2-F?" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Het Ala Met Arg Pro Gly Pro Leu Trp Leu Leu Gly Leu Ala Leu Cys 
1 5 10 15 

Ala Leu Gly Gly Gly His Gly Pro Arg Pro Pro His Thr Cys Pro Gin 
20 25 30 

Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Het Gin Arg Glu He Leu Ala 
35 40 45 

Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg Ala Gin Pro Ala Ala 
50 55 60 65 

Ala Arg Gin Pro Ala Ser Ala Pro Leu Pbe Het Leu Asp Leu Tyr His Ala 
70 75 80 

Het Thr Asp Asp Asp Asp Gly Gly Pro Pro Gin Ala His Leu Gly Arg 
85 90 95 

Ala Asp Leu Val Het Ser Phe Val Asn Het Val Glu Arg Asp Arg Thr 
100 105 110 

Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His Phe Asp Leu Thr 
115 120 125 130 

Gin He Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr 
135 140 145 

Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu His He Ser Het 
150 155 160 

Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu Phe Phe 
165 170 175 

Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Gly Trp Leu Val Leu 
180 185 190 

Asp He Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His Lys Asp 
195 200 205 210 
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Lea Gly Leu Are Leu Tyr Val Gin Thr Ala Asp Gly His Ser Met Asp 
215 220 225 

Pro Gly Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser Arg Gin 
230 235 240 

Pro Phe Met Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val Arg Ala 
245 250 255 

Pro Are Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn 
260 265 270 

Glu Leu Pro His Pro Asn Lys Leu Pro Gly He Phe Asp Asp Gly His 
275 280 285 290 

Gly Ser Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser 
295 300 305 

Phe Are Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr 
310 315 320 

Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys 
325 330 335 

Met Asn Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His Leu Met 
340 345 350 

Lys Pro Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser 
355 360 365 370 

Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val lie Leu Arg 
375 380 385 

Lys His Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
390 395 
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(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH i 1368 base pairs 

(B) TYPBs nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 



(iz) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1368 

(D) OTHER INFORMATION : /STANDARD NAHE» n 60A" 

(z) PUBLICATION INFORMATION: 

(A) AUTHORS: WHARTON, KRISTI A.; THOMSEN, GERALD H.; 
GELBERT, WILLIAM M. 

(B) TITLE: DROSOPHIU 60A GENE... 

(C) JOURNAL: PROC. NAT'L ACAD. SCI. USA 

(D) VOLUME: 88 

(E) RELEVANT RESIDUES IN SEQ ID NO: 3: FROM 1 TO 1368 

(F) PAGES: 9214-9218 

(G) DATE: OCT - 1991 

(zi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATG TCG GGA CTG CGA AAC ACC TCG GAG GCC GTT GCA GTG CTC GCC TCC 48 
Met Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
1 5 10 15 

CTG GGA CTC GGA ATG GTT CTG CTC ATG TTC GTG GCG ACC ACG CCG CCG 96 
Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

GCC GTT GAG GCC ACC CAG TCG GGG ATT TAC ATA GAC AAC GGC AAG GAC 144 
Ala Val Glu Ala Thr Gin Ser Gly lie Tyr He Asp Asn Gly Lys Asp 
35 40 45 

CAG ACG ATC ATG CAC AGA GTG CTG AGC GAG GAC GAC AAG CTG GAC GTC 192 
Gin Thr He Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 

TCG TAC GAG ATC CTC GAG TTC CTG GGC ATC GCC GAA CGG CCG ACG CAC 240 
Ser Tyr Glu He Leu Glu Phe Leu Gly He Ala Glu Arg Pro Thr His 
65 70 75 80 

CTG AGC AGC CAC CAG TIG TCG CTG AGG AAG TCG GCT CCC AAG TTC CTG 288 
Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 
85 90 95 
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CTG GAC GTC IAC CAC CGC ATC ACG GCG GAG GAG GGT CTC AGC GAT CAG 336 
Lett Asd Val Tyr His Are lie Thr Ala Glu Glu Gly Leu Ser Asp Gin 
100 105 110 

GAT GAG GAC GAC GAC XAC GAA CGC GGC CAT CGG TCC AGG AGG AGC GCC 384 
Asp Glu Asp Asp Asp Tyr Glu Arg Gly. His Arg Ser Arg Arg Sex Ala 
115 120 125 

GAC CTC GAG GAG, GAX GAG GGC GAG GAG CAG AAG AAC TTC ATC ACC GAC 432 
Asp Leu Glu Glu' Asp Glu Gly Glu Gin Gin Lys Asn Phe lie Thr Asp 
130 135 140 

CTG GAC AAG CGG GCC ATC GAC GAG AGC GAC ATC ATC ATG ACC TTC CTG 480 
Leu Asp Lys Arg Ala lie Asp Glu Ser Asp lie lie Het Thr Phe Leu 
145 150 155 160 

AAC AAG CGC CAC CAC AAT GIG GAC GAA CTG CGT CAC GAG CAC GGC CGT 528 
Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 
165 170 175 

CGC CTG TGG TTC GAC GTC TCC AAC GIG CCC AAC GAC AAC TAC CTG GIG 576 
Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 

ATG GCC GAG CTG CGC ATC TAT CAG AAC GCC AAC GAG GGC AAG TGG CTG 624 
Het Ala Glu Leu Arg He Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 

ACC GCC AAC AGG GAG TTC ACC ATC ACG GTA TAC GCC ATT GGC ACC GGC 672 
Thr Ala Asn Arg Glu Phe Thr lie Thr Val Tyr Ala lie Gly Thr Gly 
210 215 220 

ACG CTG GGC CAG CAC ACC ATG GAG COG CTG TCC TCG GTG AAC ACC ACC 720 
Tht Leu Gly Gin His Thr Het Glu Pro Leu Ser Ser Val Asn Thr Thr 
225 230 235 240 

GGG GAC TAC GTG GGC TGG TTG GAG CTC AAC GTG ACC GAG GGC CTG CAC 768 
Gly Asp Tyr Val Gly Trp* Leu Glu Leu Asn Val Thr Glu Gly Leu His 
245 250 255 

GAG TGG CTG GTC AAG TCG AAG GAC AAT CAT GGC ATC TAC ATT GGA GCA 816 
Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly lie Tyr lie Gly Ala 
260 265 270 

CAC GCT GTC AAC CGA CCC GAC CGC GAG GTG AAG CTG GAC GAC ATT GGA 864 
His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp lie Gly 
275 280 285 

CTG ATC CAC CGC AAG GTG GAC GAC GAG TTC CAG CCC TTC ATG ATC GGC 912 
Leu lie His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Het lie Gly 
290 295 300 



WO 93/05172 



107 



PCT/US92/07359 



TTC TTC CGC GGA CCG GAG CTG ATC AAG GCG ACG GCC CAC AGC AGC CAC 960 
Phe The Arg Gly Pro Glu Leu lie Lys Ala Thr Ala His Ser Ser His 
305 310 315 320 

CAC AGG AGC AAG CGA AGC GCC AGC CAT CCA CGC AAG CGC AAG AAG TCG 1008 
His Arg Ser Lys Arg Ser Ala Ser His Pro Arg Lys Arg Lys Lys Ser 
325 330 335 

GIG TCG CCC AAC AAC GTG CCG CTG CTG GAA CCG ATG GAG AGC ACG CGC 1056 
Val Ser Pro Asa Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 
340 345 350 

AGC TGC CAG ATG CAG ACC CTG TAC ATA GAC TTC AAG GAT CTG GGC TGG 1104 
Ser Cys Gin Het Gin Thr Leu Tyr He Asp Phe Lys Asp Leu Gly Trp 
355 360 365 

CAT GAC TGG ATC ATC GCA CCA GAG GGC TAT GGC GCC TTC TAC TGC AGC 1152 
His Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 

GGC GAG TGC AAT TTC CCG CTC AAT GCG CAC ATG AAC GCC ACG AAC CAT 1200 
Gly. Glu Cys Asn Phe Pro Leu Asn Ala His Het Asn Ala Thr Asn His 
385 390 395 400 

GCG ATC GTC CAG ACC CTG GTC CAC CTG CTG GAG CCC AAG AAG GTG CCC 1248 
Ala lie Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 
405 410 415 

AAG CCC TGC TGC GCT CCG ACC AGG CTG GGA GCA CTA CCC GTT CTG TAC 1296 
Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 
420 425 430 

CAC CTG AAC GAC GAG AAT GTG AAC CTG AAA AAG TAT AGA AAC ATG ATT 1344 
His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Het He 
435 440 445 

GTG AAA TCC TGC GGG TGC CAT TGA 1368 
Val Lys Ser Cys Gly Cys His 
450 455 



WO 93/05172 



108 



PCT/US92/07359 



(2) INFORMATION FOE SEQ ID N0i25< 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHs 455 amino acids 

(B) TYPE: amino add 
(D) TOPOLOGY t linear 

(ii) MOLECULE TYPE: protein 

(3d.) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Het Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 
I 5 10 15 

Leu Gly Leu Gly .Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 
20 25 30 

Ala Val Glu Ala Thr Gin Ser Gly He Tyr lie Asp Asn Gly Lys Asp 
35 40 *5 

Gin Thr lie Het His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 

Ser Tyr Glu lie Leu Glu Phe Leu Gly He Ala Glu Arg Pro Thr His 
65 70 75 80 

Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 
85 90 95 

Leu Asp Val Tyr His Arg lie Thr Ala Glu Glu Gly Leu Ser Asp Gin 
100 105 110 

Asp Glu Asp Asp Asp Tyr Gltt Arg Gly His Arg Ser Arg Arg Ser Ala 
115 120 125 

Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe lie Thr Asp 
130 135 140 

Leu Asp Lys Arg Ala lie Asp Glu Ser Asp lie lie Het Thr Phe Leu 
145 150 155 160 

Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu His Gly Arg 
165 170 175 

Arc Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn Tyr Leu Val 
180 185 190 

Het Ala Glu Leu Arg lie Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 

Thr Ala Asn Arg Glu Phe Thr lie Thr Val Tyr Ala lie Gly Thr Gly 
210 215 220 
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Thr Leu Gly Gin His Thr If t Glu Pro Leu Ser Ser Val Asn Thr Tbr 
225 230 235 240 

Gly Asp Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 
245 250 255 

Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly lie lyr He Gly Ala 
260 265 270 

His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp lie Gly 
275 280 285 

Leu He His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Het lie Gly 
290 295 300 

Phe Phe Arg Gly Pro Glu Leu He Lys Ala Thr Ala His Ser Ser His 
305 310 315 320 

His Arg Ser Lys Arg Ser Ala Ser His Pro Arg Lys Arg Lys Lys Ser 
325 330 335 

Val Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 
340 345 350 

Ser Cys Gin Met Gin Thr Leu Tyr He Asp Phe Lys Asp Leu Gly Trp 
355 360 365 

His Asp Trp He He Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 

Gly Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His 
385 390 395 400 

Ala He Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 
405 410 415 

Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 
420 425 430 

His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Het lie 
435 440 445 



Val Lys Ser Cys Gly Cys His 
450 455 



WO 93/05172 



110 



PCTAJS92/07359 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: amino acids 

(B) TYPEr amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note=*EttB3 n 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 104 amino acids 

(B) TIPS: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1.-104 

(D) OTHER INFORMATION: /note="BHP3" 

(xi)SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Qys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Gly Trp Ser 

Glu Trp lie He Ser Pro Lys Ser Phe Asp Ala iyr Try Cys Ser Gly 
20 25 30 

Ala CVs Gin Phe Pro Met Pro Lys Ser Leu Lys Pro Ser Asn His Ala 
35 40 45 

Ihr He Gin Ser He Val Ala Arg Ala Val Gly Val Val Pro Gly He 
50 55 60 

Pro Glu Pro Cys CVs Val Pro Glu Lys Met Ser Ser Leu Ser He Leu 
65 70 75 80 

Phe Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tyr Pro Asn Met 
85 90 95 

Thr Val Glu Ser Cys Ala Cys Arg 
100 
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(2) INFORMATION FOR SEQ ID NOi27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 102 amino acids 

(B) TYPE i amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(iz) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note- "BHP5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ser Cys Gly Cys His 
100 
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(2) INFORMATION FOR SEQ ID N0:28: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE* amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCES 

(A) ORGANISMr HOMO SAPIENS 

(is) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /note= "BMP6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28; 

Cys Are Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin 
I 5 10 15 

Asp Trp lie lie Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

lie Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Trp Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 
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(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 102 amino acids 

(B) TYPE: anino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label- OPX 

/note- "WHEREIN XAA AT EACH POS'N IS INDEPENDENTLY 
SELECTED FROM THE RESIDUES OCCURRING AT THE 
CORRESPONDING POS'N IN THE C-TERMINAL SEQUENCE OF MOUSE 
OR HUMAN 0P1 OR 0P2 (SEE SEQ. ID NOS. 5,6,7 and 8 or 
16,18,20 and 22.)" 

(zi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Cys Xaa Xaa His Glu Leu Tyr Val Xaa Phe Xaa Asp Leu Gly Trp Xaa 
1 5 10 15 

Asp Trp Xaa lie Ala Pro Xaa Gly Tyr Xaa Ala Tyr Tyr Cys Glu Gly 
20 25 30 

Glu Cys Xaa Phe Pro Leu Xaa Ser Xaa Met Asn Ala Thr Ash His Ala 
35 40 45 

He Xaa Gin Xaa Leu Val His Xaa Xaa Xaa Pro Xaa Xaa Val Pro Lys 
50 55 60 

Xaa Cys Cys Ala Pro Thr Xaa Leu Xaa Ala Xaa Ser Val Leu Tyr Xaa 
65 70 75 80 

Asp Xaa Ser Xaa Asn Val Xaa Leu Xaa Lys Xaa Arg Asn Met Val Val 
85 90 95 



Xaa Ala Cys Gly Cys His 
100 
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(2) INFORMATION FOR SEQ ID N0i30: 

(i) SEQUWCE CHARACTERISTICS : 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 
(ix)FEATORE: 

(A) NAME:: Generic Sequence 5 

(D) OTHER INFORMATION: wherein each Xaa is independently 
selected from a group of one or more specified amino acids as 
defined in the specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 



Leu 


Xaa 


Xaa Xaa Phe 




1 


5 


Xaa 


Xaa 


Xaa Gly Trp Xaa Xaa Trp Xaa 






10 


Xaa 


Xaa 


Pro Xaa Xaa Xaa Xaa Ala 


15 




20 


Xaa 


Tyr 


Cys Xaa Gly Xaa Cys Xaa 




25 


30 


Xaa 


Pro 


Xaa Xaa Xaa Xaa Xaa 






35 


Xaa 


Xaa 


Xaa Asn His Ala Xaa Xaa 




40 


45 


Xaa 


Xaa 


Xaa Xaa Xaa Xaa Xaa Xaa 



Xaa 



Cys 



Xaa Xaa Xaa Xaa Xaa Xaa Cys 
55 60 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 



Xaa 
70 
Xaa 



65 

Xaa Xaa Leu Xaa Xaa Xaa 



75 



Xaa Xaa Xaa Val Xaa Leu Xaa 



80 



Xaa 
85 
Xaa 



Xaa Xaa Xaa Met Xaa Val Xaa 



90 



Cys Xaa Cys Xaa 
95 
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(2) INFORMATION FOR SEQ ID NO: 31s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acids 

(C) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 
(ix) FEATURE: 

(A) NAME: Generic Sequence 6 

(D) OTHER INFORMATION: wherein each Xaa is independently 
selected from a group of one or more specified amino acids as 
defined in the specification. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Cys Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Phe 
15 10 
Xaa Xaa Xaa Gly Trp Xaa Xaa Trp Xaa 
15 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala 
20 25 
Xaa Tyr Cys Xaa Gly Xaa Cys Xaa 
30 35 
Xaa Pro Xaa Xaa Xaa Xaa Xaa 
40 

Xaa Xaa Xaa Asn His Ala Xaa Xaa 
45 50 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
55 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

60 65 
Cys Xaa Pro Xaa Xaa Xaa Xaa Xaa 
70 

Xaa Xaa Xaa Leu Xaa Xaa Xaa 

75 80 
Xaa Xaa Xaa Xaa Val Xaa Leu Xaa 
85 

Xaa Xaa Xaa Xaa Het Xaa Val Xaa 
90 95 
Xaa Cys Xaa Cys Xaa 
100 
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(2) INFORMATION FOB. SEQ ID N0:32: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1238 base pairs, 372 amino acids 

(B) TYPE: nucleic acid, amino acid 
(O STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ill) ORIGINAL SOURCE: 
(A) ORGANISM: human 
(F) TISSUE TYPE: BRAIN 

(iv) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 

(0) OTHER INFORMATION: 

/product- «GDF-1" 
/note- "GDF-1 CDNA" 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Lee, Se-Jin 

(B) TITTLE: Expression of Growth/Differentiation Factor 1 

(C) JOURNAL: Proc. Nat'l Acad. Sci. 

(D) . VOLUME: 88 

(E) RELEVANT RESIDUES: 1-1238 

(F) PAGES: 4250-4254 

(G) DATE: Kay-l99l 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GGGGACACCG GCCCCGCCCT CAGCCCACTG GTCCCGGGCC GCCGCGGACC CTGCGCACTC 60 

TCTGGTCATC GCCTGGGAGG AAG ATG CCA CCG CCG CAG CAA GGT CCC TGC GGC 113 

Met Pro Pro Pro Gin Gin Gly Pro Cys. Gly 
1 5 10 

CAC CAC CTC CTC CTC CTC CTG GCC CTG CTG CTG CCC TCG CTG CCC 158 
His His Leu Leu Leu Leu Leu Ala Leu Leu Leu Pro Ser Leu Pro 
15 20 25 

CTG ACC CGC GCC CCC GTG CCC CCA GGC CCA GCC GCC GCC CTG CTC 203 
Leu Thr Are Ala Pro Val Pro Pro Gly Pro Ala Ala Ala Leu Leu 
30 35 40 

CAG GCT CTA GGA CTG CGC GAT GAG CCC CAG GGT GCC CCC AGG CTC 248 
Gin Ala Leu Gly Leu Arg Asp Glu Pro Gin Gly Ala Pro Arg Leu 
45 50 55 
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CGG COG GTT CCC CCG GTC ATG TGG CGC CTG TTT CGA CGC CGG GAC 293 
Arg Pro Val Fro Pro Val Met Trp Arg Leu Phe Arg Arg Arg Asp 
60 65 70 

CCC CAG GAG ACC AGG TCT GGC TCG CGG CGG ACG TCC CCA GGG GTC 338 
Pro Gin Glu Thr Arg Ser Gly Ser Arg Arg Thr Ser Pro Gly Val 
75 80 85 

ACC CTG CAA CCG TGC CAC GIG GAG GAG CTG GGG GTC GCC GGA AAC 383 
Thr Leu Gin Pro Cyc His Val Glu Glu Leu Gly Val Ala Gly Asn 
90 95 100 

ATC GTG CGC CAC ATC CCG GAC CGC GGT GCG CCC ACC CGG GCC TCG 428 
lie Val Arg His lie Pro Asp Arg Gly Ala Pro Thr Arg Ala Ser 
105 110 115 

GAG CCT GTC TCG GCC GCG GGG CAT TGC CCT GAG TGG ACA GTC GTC 473 
Glu Pro Val Ser Ala Ala Gly His Cys Pro Glu Trp Thr Val Val 
120. 125 130 

TTC GAC CTG TCG GCT GTG GAA CCC GCT GAG CGC CCG AGC CGG GCC 518 
Phe Asp Leu Ser Ala Val Glu Pro Ala Glu Arg Pro Ser Arg Ala 
135 140 145 

CGC CTG GAG CTG CGT TTC GCG GCG GCG GCG GCG GCA GCC CCG GAG 563 
Arg Leu Glu Leu Arg Phe Ala Ala Ala Ala Ala Ala Ala Pro Glu 
150 155 160 

GGC GGC TGG GAG CTG AGC GTG GCG CAA GCG GGC CAG GGC GCG GGC 608 
Gly Gly Trp Glu Leu Ser Val Ala Gin Ala Gly Gin Gly Ala Gly 
165 170 175 

GCG GAC CCC GGG CCG GTG CTG CTC CGC CAG TTG GTG CCC GCC CTG 653 
Ala Asp Pro Gly Pro Val Leu Leu Arg Gin Leu Val Pro Ala Leu 
180 185 190 

GGG CCG CCA GTG CGC GCG GAG CTG CTG GGC GCC GCT TGG GCT CGC 698 
Gly Pro Pro Val Arg Ala Glu Leu Leu Gly Ala Ala Trp Ala Arg 
195 200 205 

AAC GCC TCA TGG CCG CGC AGC CTC CGC CTG GCG CTG GCG CIA CGC 743 
Asn Alai Ser Trp Pro Arg Ser Leu Arg Leu Ala Leu Ala Leu Arg 
210 215 220 

CCC CGG GCC CCT GCC GCC TGC GCG CGC CTG GCC GAG GCC TCG CTG 788 
Pro Arg Ala Pro Ala Ala Cys Ala Arg Leu Ala Glu Ala Ser Leu 
225 230 235 

CTG CTG GTG ACC CTC GAC CCG CGC CTG TGC CAC CCC CTG GCC CGG 833 
Leu Leu Val Thr Leu Asp Pro Arg Leu Cys His Pro Leu Ala Arg 
240 245 250 
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CCG CGG CGC GAC GCC GAA CCC GTG TTG GGC GGC GGC CCC GGG GGC 878 
Pro Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly 
255 260 265 

GCT TGT OGC GCG CGG CGG CIG TAC GTG AGC TIC CGC CAG GTG GGC 923 
Ala Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly 
270 275 280 

TGG CAC CGC TGG GIC ATC GCG CCG CGC CCC TEC CIG GCC AAC TAC 968 
Trp His Arg Trp Val lie Arg Pro Arg Gly Phe Leu Ala Asn Tyr 
v 285 290 295 

TGC CAG GGT CAG TGC GCG CTG CCC GTC GCG CTG TCG GGG TCC GGG 1013 
Cvs Gin Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly 
r 300 305 310 

GGG CCG CCG GCG CTC AAC CAC GCT GTG CTG CGC GCG CTC ATG CAC 1058 
Gly Pro Pro Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His 
' 315. 320 325 

GCG GCC GCC CCG GGA GCC GCC GAC CTG CCC TGC TGC GTG CCC GCG 1103 
Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala 
330 335 340 

CGC CTG TCG CCC ATC TCC GTG CTC TTC TTT GAC AAC AGC GAC AAC 1148 
Arg Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn Ser Asp Asn 
345 350 355 

GTG GTG CIG CGG CAG TAT GAG GAC ATG GIG GTG GAC GAG TGC GGC 1193 
Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu Cys Gly 
360 365 370 

TGC CGC TAACCCGGGG CGGGCAGGGA CCCGGGCCCA ACAATAAATG CCGCGTGG 1238 
Cys Arg 

372 ( 
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(34) INFORMATION FOR SEQ ID NO: 33: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: human 
(F) TISSUE TYPE: BRAIN 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 

(D) OTHER INFORMATION: /function- 
/product- "GDF-l" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Het Pro Pro Pro Gin Gin Gly Pro Cys Gly 
1 5 10 

His His Leu Leu Leu Leu Leu Ala Leu Leu Leu Pro Ser Leu Pro 
15 20 25 

Leu Thr Arg Ala Pro Val Pro Pro Gly Pro Ala Ala Ala Leu Leu 
30 35 40 

Gin Ala Leu Gly Leu Arg Asp Glu Pro Gin Gly Ala Pro Arg Leu 
45 50 55 

Arg Pro Val Pro Pro Val Het Trp Arg Leu Phe Arg Arg Arg Asp 
60 65 70 

Pro Gin Glu Thr Arg Ser Gly Ser Arg Arg Thr Ser Pro Gly Val 
75 SO 85 

Thr Leu Gin Pro Cyc His Val Glu Glu Leu Gly Val Ala Gly Asn 
90 95 100 

lie Val Arg His lie Pro Asp Arg Gly Ala Pro Thr Arg Ala Ser 
105 110 115 
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Gltt Pro Val Ser Ala Ala Gly His Cys Pro Glu Trp Thr Val Val 
120 125 130 

Phe Asp Leu Ser Ala Val Glu Pro Ala Glu Arg Pro Ser Arg Ala 
135 140 145 

Are Leu Glu Leu Arg Phe Ala Ala Ala Ala Ala Ala Ala Pro Glu 
150 155 160 

Gly Gly Trp Glu Leu Ser Val Ala Gin Ala Gly Gin Gly Ala Gly 
165 170 175 

Ala Asp Pro Gly Pro Val Leu Leu Arg Gin Leu Val Pro Ala Leu 
180 185 190 

Gly Pro Pro Val Arg Ala Glu Leu Leu Gly Ala Ala Trp Ala Arg 
' 195 200 205 

Asn Ala Ser Trp Pro Arg Ser Leu Arg Leu Ala Leu Ala Leu Arg 
210 215 220 

Pro Arg Ala Pro Ala Ala Cys Ala Arg Leu Ala Glu Ala Ser Leu 
225 230 235 

Leu Leu Val Thr Leu Asp Pro Arg Leu Cys His Pro Leu Ala Arg 
240 245 250 

Pro Arg Arg Asp Ala Glu Pro Val Leu Gly Gly Gly Pro Gly Gly 
255 260 265 

Ala CVs Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly 
270 275 280 

Trp His Are Trp Val lie Arg Pro Arg Gly Phe Leu Ala Asn Tyr 
v 285 290 295 

Cys Gin Gly Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly 
' 300 305 310 

Gly Pro Pro Ala Leu Asn His Ala Val Leu Arg Ala Leu Met His 
' 315 320 325 

Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys Cys Val Pro Ala 
330 335 340 

Arsr Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn Ser Asp Asn 
345 350 355 

Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu Cys Gly 
360 365 370 



Cys Arg 
372 
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What is claimed isx 

1. A method of screening candidate compounds for the 
ability to modulate the effective concentration of a 
morphogen in an organism, said method comprising 

incubating a candidate compound with cells 
from a test tissue type known to produce a morphogen for a 
time sufficient to allow said compound to affect the 
production of said morphogen, and 

assaying said cells for a parameter 
indicative of a change in the level of production of said 
morphogen. 

2. The method of claim 1 wherein said morphogen 
is OP-1. 

3* The method of claim 2 wherein said test tissue type 
is a human renal-derived tissue* 

4 • The method of claim 3 wherein said renal-derived 
tissue is a kidney or bladder-derived tissue. 

5. The method of claim 2 wherein said test tissue type 
is adrenal-derived tissue. 

6. The method of claim 1 wherein said morphogen is 
GDF-1. 

7. The method of claim 6 wherein said test tissue type 
is derived from human nerve tissue. 
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8. The method of claim 7 wherein said nerve tissu is 
brain-derived tissue. 

9. The method of claim 1 wherein said morphogen is 

DPP. 

10. The method of claim 9 wherein said test tissue 
type is derived from one of the following drosophila 
tissues: dorsal ectoderm, epithelial imaginal disc visceral 
mesoderm, or gut endoderm. 

11. The method of claim 1 wherein said morphogen is 
Vgr-1. 

12. The method of claim 11 wherein said test tissue 
type is mouse lung tissue. 

13. The method of claim 1 wherein said morphogen is 

Vgl. 

14. The method of claim 13 wherein said test tissue 
type is xenopus fetal endoderm tissue. 

15. A method of assessing a tissue of an organism for 
its level of production of a morphogen and for screening 
candidate compounds for the ability to modulate the 
effective concentration of said morphogen produced by cells 
of said tissue, said method comprising 

selecting a test tissue type producing a high 
level of morphogen relative to the level of morphogen 
produced by other tissue types? 
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incubating a candidate compound with cultured 
cells of said selected tissue type for a time sufficient to 
allow said compound to affect the production of said 
morphogen; and 

assaying said selected tissue cells for a 
parameter indicative of a change in the level of production 
of said morphogen. 

16* The method of claim 1 or 15 wherein said parameter 
indicative of the level of said morphogen is determined 
using an antibody specific for said morphogen. 

17. The method of claim 1 or 15 wherein said parameter 
indicative of the level of said morphogen is determined by 
measuring cellular proliferation in cells which are 
sensitive to the concentration of secreted OP-1. 

. 18. The method of claim 1 or 15 wherein said parameter 
indicative of the level of said morphogen is determined 
using a nucleic acid probe that hybridizes under stringent 
conditions with nucleic acid encoding said morphogen. 

19. The method of claim 18 wherein said morphogen 
comprises a minimally active core C-terminal region 
comprising at least six cysteine residues, and said nucleic 
acid probe hybridizes with an mRNA encoding a region N- 
terminal to said core region. 

20. The method of claim 18 wherein said morphogen 
comprises a minimally active core C-terminal region 
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comprising at 1 ast six cysteine r sidu s, and said nucleic 
acid probe hybridizes with an mRNA encoding a region 3' to 
said core region. 
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